The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"
☆56Oct 24, 2025Updated 4 months ago
Alternatives and similar repositories for Same-Task-More-Tokens
Users that are interested in Same-Task-More-Tokens are comparing it to the libraries listed below
Sorting:
- Stick-breaking attention☆62Jul 1, 2025Updated 8 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆112Feb 20, 2025Updated last year
- Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…☆32Jan 27, 2023Updated 3 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆79Nov 25, 2024Updated last year
- Extract links from Wikipedia pages to create a cross-document coreference dataset (multilingual support)☆11Apr 13, 2023Updated 2 years ago
- self-adaptive in-context learning☆45May 5, 2023Updated 2 years ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆110Oct 11, 2025Updated 5 months ago
- codebase for the Text-based NP Enrichment (TNE) paper☆19Mar 12, 2024Updated 2 years ago
- ☆63Jun 12, 2025Updated 9 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆33Feb 26, 2026Updated 3 weeks ago
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Jun 15, 2023Updated 2 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆374Jan 4, 2024Updated 2 years ago
- ☆11Nov 5, 2024Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Oct 11, 2024Updated last year
- Extended Wikilinks dataset description☆15Apr 1, 2018Updated 7 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆259Dec 16, 2024Updated last year
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- ☆14Dec 19, 2024Updated last year
- Training project about Deep Learing☆12Jun 22, 2017Updated 8 years ago
- ☆17Jun 14, 2023Updated 2 years ago
- ☆19Mar 25, 2025Updated 11 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆195Oct 8, 2024Updated last year
- Reproducible code for Augmentation paper☆17Jan 23, 2019Updated 7 years ago
- Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)☆12Oct 11, 2023Updated 2 years ago
- ☆14Aug 25, 2021Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 5 years ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆247Sep 12, 2025Updated 6 months ago
- ☆27Feb 23, 2026Updated 3 weeks ago
- Code for "Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection"☆16Apr 25, 2022Updated 3 years ago
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆79Oct 16, 2024Updated last year
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Apr 3, 2022Updated 3 years ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆61Jul 16, 2025Updated 8 months ago
- ☆11Oct 11, 2023Updated 2 years ago
- Flash Attention in 300-500 lines of CUDA/C++☆36Aug 22, 2025Updated 6 months ago