SemanticScuttle - klotz.me » klotz: llm+rope

klotz: llm* + rope*

Microsoft AI Released LongRoPE2: A Near-Lossless Method to Extend Large Language Model Context Windows to 128K Tokens While Retaining Over 97% Short-Context Accuracy

Microsoft researchers introduce LongRoPE2, a method to extend large language model context windows to 128K tokens while maintaining over 97% short-context accuracy, addressing key limitations in positional embeddings.

2025-03-04 Tags: llm, rope, longrope2, positional embeddings, microsoft by klotz

Understanding Long RoPE in LLMs

This article explains the Long RoPE methodology used to expand the context lengths in LLMs without significant performance degradation. It discusses the importance of context length in LLMs and the limitations of previous positional encoding methods. The article then introduces Rotational Positional Encoding (RoPE) and its limitations, and explains how Long RoPE extends RoPE to larger contexts.

2024-05-15 Tags: llm, rope, context, positional encoding, transformers, rotational positional encoding by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: llm* + rope*

Linked Tags

Related Tags