datamllab/LongLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/datamllab/LongLM)

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

☆668

Alternatives and similar repositories for LongLM

Users that are interested in LongLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HKUNLP / ChunkLlama
View on GitHub
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆450Oct 16, 2024Updated last year
sdan / selfextend
View on GitHub
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Jan 7, 2024Updated 2 years ago
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
FranxYao / Long-Context-Data-Engineering
View on GitHub
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆501Mar 19, 2024Updated 2 years ago
GAIR-NLP / Entropy-ABF
View on GitHub
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'
☆82Jan 18, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,737Apr 17, 2024Updated 2 years ago
JIA-Lab-research / LongLoRA
View on GitHub
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,689Aug 14, 2024Updated last year
thunlp / InfLLM
View on GitHub
The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…
☆405Apr 20, 2024Updated 2 years ago
jy-yuan / KIVI
View on GitHub
[ICML 2024] KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
☆418Nov 20, 2025Updated 8 months ago
THUDM / LongBench
View on GitHub
LongBench v2 and LongBench (ACL 25'&24')
☆1,212Jan 15, 2025Updated last year
jzhang38 / EasyContext
View on GitHub
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆759Sep 27, 2024Updated last year
schwartz-lab-NLP / TOVA
View on GitHub
Token Omission Via Attention
☆131Oct 13, 2024Updated last year
FasterDecoding / SnapKV
View on GitHub
☆324Jul 10, 2025Updated last year
ynchuang / ad2kg
View on GitHub
We have created a new Github repository. Please visit https://github.com/ynchuang/DiscoverPath for the latest update.
☆17Sep 3, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
microsoft / MInference
View on GitHub
[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…
☆1,221Apr 8, 2026Updated 3 months ago
jshuadvd / LongRoPE
View on GitHub
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆154Jul 20, 2024Updated 2 years ago
henryzhongsc / longctx_bench
View on GitHub
KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches. EMNLP Findings 2024
☆89Feb 27, 2025Updated last year
DAMO-NLP-SG / CLEX
View on GitHub
[ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models
☆78Mar 12, 2024Updated 2 years ago
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,246Jun 17, 2026Updated last month
guanchuwang / Taylor-Unswift
View on GitHub
☆22Oct 3, 2024Updated last year
tomaarsen / attention_sinks
View on GitHub
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆735Apr 10, 2024Updated 2 years ago
princeton-nlp / ProLong
View on GitHub
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆260Sep 12, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Glaciohound / LM-Infinite
View on GitHub
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆152Mar 13, 2025Updated last year
mit-han-lab / streaming-llm
View on GitHub
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,248Jul 11, 2024Updated 2 years ago
hao-ai-lab / Consistency_LLM
View on GitHub
[ICML 2024] CLLMs: Consistency Large Language Models
☆416Nov 16, 2024Updated last year
IST-DASLab / qmoe
View on GitHub
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆278Nov 3, 2023Updated 2 years ago
chuanyang-Zheng / DAPE
View on GitHub
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆41Oct 11, 2024Updated last year
LuLuLuyi / LongHeads
View on GitHub
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆32Apr 8, 2024Updated 2 years ago
yangjianxin1 / LongQLoRA
View on GitHub
LongQLoRA: Extent Context Length of LLMs Efficiently
☆170Nov 12, 2023Updated 2 years ago
booydar / babilong
View on GitHub
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆250Jun 1, 2026Updated last month
gkamradt / needle-in-a-haystack
View on GitHub
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆2,346Jun 8, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
XuezheMax / megalodon
View on GitHub
Reference implementation of Megalodon 7B model
☆526May 17, 2025Updated last year
princeton-nlp / CEPE
View on GitHub
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆169Jun 13, 2024Updated 2 years ago
ZichengXu / Decoding-Tree-Sketching
View on GitHub
[ICML 2026] Decoding Tree Sketching (DTS): a training-free & model agonistic & plug-in framework for LLM parallel reasoning.
☆71May 12, 2026Updated 2 months ago
databricks / megablocks
View on GitHub
☆1,582Mar 25, 2026Updated 3 months ago
Xnhyacinth / Awesome-LLM-Long-Context-Modeling
View on GitHub
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
☆2,146Jul 1, 2026Updated 2 weeks ago
FasterDecoding / Medusa
View on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,757Jun 25, 2024Updated 2 years ago
microsoft / LLMLingua
View on GitHub
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…
☆6,454Apr 8, 2026Updated 3 months ago