megagonlabs / holobenchLinks
π«§ Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.; ICLR 2025)
β12Updated 5 months ago
Alternatives and similar repositories for holobench
Users that are interested in holobench are comparing it to the libraries listed below
Sorting:
- Common tools for data processingβ17Updated 4 months ago
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)β13Updated 8 months ago
- SysBench: Can Large Language Models Follow System Messages?β33Updated 11 months ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"β12Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ153Updated 2 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrievalβ50Updated last month
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"β77Updated 8 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queriesβ27Updated 5 months ago
- β38Updated 2 months ago
- List of papers on Self-Correction of LLMs.β74Updated 7 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)β123Updated last month
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAβ¦β29Updated 8 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".β78Updated last year
- Test-time compute in information retrievalβ38Updated last month
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Trainingβ22Updated 11 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)β15Updated 2 years ago
- Long Context Extension and Generalization in LLMsβ58Updated 10 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processorβ29Updated last year
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).β57Updated last year
- LightThinker: Thinking Step-by-Step Compressionβ68Updated 4 months ago
- β26Updated last month
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoningβ25Updated 5 months ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attributionβ35Updated 2 years ago
- β13Updated 8 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tβ¦β114Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningβ108Updated 3 months ago
- Benchmarking Benchmark Leakage in Large Language Modelsβ55Updated last year
- Official Implementation of "Probing Language Models for Pre-training Data Detection"β19Updated 8 months ago
- Library for training process reward modelsβ27Updated 2 months ago
- β115Updated last year