google-deepmind/loft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-deepmind/loft)

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

☆237

Alternatives and similar repositories for loft

Users that are interested in loft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

princeton-nlp / ProLong
View on GitHub
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆260Sep 12, 2025Updated 10 months ago
princeton-nlp / HELMET
View on GitHub
The HELMET Benchmark
☆220Apr 17, 2026Updated 3 months ago
OpenBMB / InfiniteBench
View on GitHub
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆387Sep 25, 2024Updated last year
TIGER-AI-Lab / LongICLBench
View on GitHub
Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]
☆113Feb 20, 2025Updated last year
zexuanqiu / CLongEval
View on GitHub
CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models
☆49Mar 7, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FranxYao / Long-Context-Data-Engineering
View on GitHub
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆501Mar 19, 2024Updated 2 years ago
princeton-pli / LongProc
View on GitHub
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
☆36Feb 26, 2026Updated 4 months ago
MozerWang / Loong
View on GitHub
[EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA
☆155Dec 22, 2025Updated 6 months ago
bigai-nlco / LooGLE
View on GitHub
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
☆199Oct 8, 2024Updated last year
nightdessert / Retrieval_Head
View on GitHub
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
☆241Aug 2, 2024Updated last year
HKUNLP / ChunkLlama
View on GitHub
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆450Oct 16, 2024Updated last year
salesforce / summary-of-a-haystack
View on GitHub
Codebase accompanying the Summary of a Haystack paper.
☆82Jun 25, 2026Updated 3 weeks ago
NVIDIA / RULER
View on GitHub
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
☆1,582Jun 25, 2026Updated 3 weeks ago
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
THUDM / LongBench
View on GitHub
LongBench v2 and LongBench (ACL 25'&24')
☆1,212Jan 15, 2025Updated last year
OpenLMLab / LEval
View on GitHub
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
☆406Jul 9, 2024Updated 2 years ago
whyNLP / LCKV
View on GitHub
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…
☆157Apr 7, 2025Updated last year
FasterDecoding / SnapKV
View on GitHub
☆324Jul 10, 2025Updated last year
Xnhyacinth / Awesome-LLM-Long-Context-Modeling
View on GitHub
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
☆2,146Jul 1, 2026Updated 2 weeks ago
booydar / babilong
View on GitHub
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆250Jun 1, 2026Updated last month
microsoft / FILM
View on GitHub
Official repo for "Make Your LLM Fully Utilize the Context"
☆275May 15, 2024Updated 2 years ago
gkamradt / needle-in-a-haystack
View on GitHub
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆2,346Jun 8, 2026Updated last month
RulinShao / retrieval-scaling
View on GitHub
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆226Dec 16, 2025Updated 7 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
princeton-nlp / CEPE
View on GitHub
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆169Jun 13, 2024Updated 2 years ago
THUDM / LongAlign
View on GitHub
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
☆261Dec 16, 2024Updated last year
google-deepmind / long-form-factuality
View on GitHub
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
☆692Jun 18, 2026Updated last month
evalplus / repoqa
View on GitHub
RepoQA: Evaluating Long-Context Code Understanding
☆136Nov 1, 2024Updated last year
PKU-ML / LongPPL
View on GitHub
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆115Oct 11, 2025Updated 9 months ago
VITA-Group / Ms-PoE
View on GitHub
"Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…
☆35May 7, 2024Updated 2 years ago
bryanchrist / MathNeuro
View on GitHub
Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes
☆23Jun 15, 2025Updated last year
nick7nlp / Counting-Stars
View on GitHub
Counting-Stars (★)
☆83Nov 24, 2025Updated 7 months ago
mozhu621 / LongGenBench
View on GitHub
☆37Oct 4, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jshuadvd / LongRoPE
View on GitHub
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆154Jul 20, 2024Updated 2 years ago
assafbk / OPRM
View on GitHub
Overflow Prevention Enhances Long-Context Recurrent LLMs (COLM 2025)
☆18Jul 8, 2025Updated last year
dmis-lab / ANGEL
View on GitHub
Learning from Negative samples for Biomedical Generative Entity Linking
☆18May 25, 2025Updated last year
magicproduct / hash-hop
View on GitHub
Long context evaluation for large language models
☆230Jul 7, 2026Updated last week
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
facebookresearch / RAM
View on GitHub
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆378Jun 25, 2026Updated 3 weeks ago
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago