nightdessert / Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
☆172Updated 6 months ago
Alternatives and similar repositories for Retrieval_Head:
Users that are interested in Retrieval_Head are comparing it to the libraries listed below
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆153Updated 2 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆130Updated 4 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆100Updated 7 months ago
- ☆129Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆168Updated 9 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆118Updated 3 weeks ago
- The HELMET Benchmark☆114Updated last week
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆115Updated 5 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆182Updated last week
- ☆146Updated last week
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆69Updated 2 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆94Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆103Updated 10 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆111Updated 3 months ago
- Reproducible, flexible LLM evaluations☆158Updated 2 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆137Updated 4 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆140Updated 11 months ago
- ☆89Updated last year
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆204Updated 8 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆93Updated 3 months ago
- ☆50Updated 2 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆178Updated 6 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆111Updated 2 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆90Updated last month
- Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆135Updated 4 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆156Updated this week
- ☆58Updated 9 months ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆49Updated 4 months ago
- The official repository of the Omni-MATH benchmark.☆71Updated last month