megagonlabs / holobenchLinks
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.; ICLR 2025)
☆12Updated 3 months ago
Alternatives and similar repositories for holobench
Users that are interested in holobench are comparing it to the libraries listed below
Sorting:
- Common tools for data processing☆13Updated 2 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆19Updated 2 months ago
- ☆20Updated last month
- ☆22Updated 5 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆51Updated last year
- PathPiece tokenizer☆12Updated 6 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆50Updated 7 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 5 months ago
- AbstainQA, ACL 2024☆25Updated 7 months ago
- SysBench: Can Large Language Models Follow System Messages?☆30Updated 9 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆14Updated last year
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆59Updated 5 months ago
- A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.☆23Updated last year
- ☆29Updated 5 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆10Updated 4 months ago
- List of papers on Self-Correction of LLMs.☆73Updated 5 months ago
- ☆53Updated 11 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 9 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated 11 months ago
- ☆45Updated 9 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆125Updated 2 weeks ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆19Updated 6 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆75Updated 6 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆21Updated 10 months ago
- ☆16Updated last year
- Aioli: A unified optimization framework for language model data mixing☆25Updated 4 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated last month
- ☆23Updated 2 weeks ago