The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"
☆56Oct 24, 2025Updated 5 months ago
Alternatives and similar repositories for Same-Task-More-Tokens
Users that are interested in Same-Task-More-Tokens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 8 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆114Feb 20, 2025Updated last year
- HumanLM: Simulating Users with State Alignment Beats Response Imitation☆70Feb 27, 2026Updated last month
- Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…☆32Jan 27, 2023Updated 3 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆236Aug 2, 2024Updated last year
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆31Apr 8, 2024Updated 2 years ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆81Nov 25, 2024Updated last year
- Extract links from Wikipedia pages to create a cross-document coreference dataset (multilingual support)☆11Apr 13, 2023Updated 2 years ago
- self-adaptive in-context learning☆45May 5, 2023Updated 2 years ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆149Nov 9, 2024Updated last year
- Implementation of a holodeck, written in Pytorch☆19Nov 1, 2023Updated 2 years ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆110Oct 11, 2025Updated 5 months ago
- codebase for the Text-based NP Enrichment (TNE) paper☆19Mar 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆63Jun 12, 2025Updated 9 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆33Feb 26, 2026Updated last month
- ☆31Jul 2, 2023Updated 2 years ago
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Jun 15, 2023Updated 2 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆375Jan 4, 2024Updated 2 years ago
- ☆11Nov 5, 2024Updated last year
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Oct 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆10Jul 18, 2022Updated 3 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- Extended Wikilinks dataset description☆15Apr 1, 2018Updated 8 years ago
- [NeurIPS 2022] DreamShard: Generalizable Embedding Table Placement for Recommender Systems☆28Mar 24, 2023Updated 3 years ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆260Dec 16, 2024Updated last year
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- ☆14Dec 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Training project about Deep Learing☆12Jun 22, 2017Updated 8 years ago
- ☆17Jun 14, 2023Updated 2 years ago
- ☆19Mar 25, 2025Updated last year
- Data and code for "A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization" (ACL 2020)☆48Jun 12, 2023Updated 2 years ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆195Oct 8, 2024Updated last year
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆29Jan 28, 2024Updated 2 years ago
- Reproducible code for Augmentation paper☆17Jan 23, 2019Updated 7 years ago