megagonlabs / holobenchLinks
π«§ Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.; ICLR 2025)
β12Updated 7 months ago
Alternatives and similar repositories for holobench
Users that are interested in holobench are comparing it to the libraries listed below
Sorting:
- Common tools for data processingβ20Updated this week
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)β15Updated 11 months ago
- SysBench: Can Large Language Models Follow System Messages?β35Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ168Updated last month
- Test-time compute in information retrievalβ44Updated 3 months ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"β20Updated 10 months ago
- List of papers on Self-Correction of LLMs.β78Updated 9 months ago
- β123Updated 2 years ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"β77Updated 10 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.β63Updated last year
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compressionβ106Updated 6 months ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"β12Updated last year
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)β124Updated 3 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingβ51Updated 4 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queriesβ27Updated 7 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"β35Updated 6 months ago
- β22Updated 10 months ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoningβ26Updated 7 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)β15Updated 2 years ago
- AbstainQA, ACL 2024β28Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningβ114Updated 5 months ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"β13Updated 2 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructionsβ48Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrievalβ51Updated 4 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAβ¦β29Updated 10 months ago
- QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-rankingβ24Updated last month
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)β16Updated 2 years ago
- Long Context Extension and Generalization in LLMsβ61Updated last year
- β57Updated 10 months ago
- β53Updated last year