megagonlabs / holobench
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.; ICLR 2025)
☆10Updated 2 months ago
Alternatives and similar repositories for holobench:
Users that are interested in holobench are comparing it to the libraries listed below
- Common tools for data processing☆12Updated last month
- https://arxiv.org/abs/2404.10917☆14Updated last month
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆61Updated 10 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆14Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆21Updated 8 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Updated 10 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 6 months ago
- Evaluate the Quality of Critique☆34Updated 11 months ago
- ☆43Updated 9 months ago
- ☆22Updated 4 months ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆12Updated last year
- Official codebase for permutation self-consistency.☆18Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆21Updated 10 months ago
- An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset☆24Updated 3 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated last week
- ☆11Updated 6 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆44Updated 10 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 11 months ago
- List of papers on Self-Correction of LLMs.☆72Updated 4 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- AbstainQA, ACL 2024☆25Updated 7 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 4 months ago
- ☆34Updated 10 months ago
- SysBench: Can Large Language Models Follow System Messages?☆29Updated 8 months ago
- ☆25Updated 2 years ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆35Updated 2 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆37Updated this week