megagonlabs / holobenchLinks
π«§ Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.; ICLR 2025)
β12Updated 8 months ago
Alternatives and similar repositories for holobench
Users that are interested in holobench are comparing it to the libraries listed below
Sorting:
- Common tools for data processingβ21Updated 3 weeks ago
- SysBench: Can Large Language Models Follow System Messages?β35Updated last year
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)β15Updated 11 months ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"β12Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLPβ24)β26Updated last month
- Official Implementation of "Probing Language Models for Pre-training Data Detection"β20Updated 11 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""β19Updated 4 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]β75Updated 11 months ago
- β45Updated 5 months ago
- β22Updated 10 months ago
- β30Updated 10 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.β63Updated last year
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generationβ30Updated 3 weeks ago
- The paper list of multilingual pre-trained models (Continual Updated).β23Updated last year
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LAβ¦β30Updated 11 months ago
- Implementation of AdaCQR(COLING 2025)β12Updated 10 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processorβ31Updated last year
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)β15Updated 2 years ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Trainingβ22Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"β36Updated 7 months ago
- Short RLβ14Updated 5 months ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"β25Updated 10 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ172Updated last month
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"β75Updated 5 months ago
- β40Updated last year
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memoryβ62Updated 2 years ago
- β55Updated last year
- List of papers on Self-Correction of LLMs.β80Updated 10 months ago
- Towards Systematic Measurement for Long Text Qualityβ37Updated last year
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoningβ26Updated 8 months ago