microsoft / LongRoPE

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
82Updated 3 weeks ago

Related projects: