nightdessert / Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
โ172Updated 5 months ago
Alternatives and similar repositories for Retrieval_Head:
Users that are interested in Retrieval_Head are comparing it to the libraries listed below
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"โ150Updated last month
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]โ130Updated 4 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ153Updated 9 months ago
- Repo of paper "Free Process Rewards without Process Labels"โ110Updated last week
- โ250Updated last year
- Reproducible, flexible LLM evaluationsโ127Updated last month
- โ129Updated last month
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"โ97Updated 6 months ago
- The HELMET Benchmarkโ106Updated this week
- โ142Updated last week
- Benchmarking LLMs with Challenging Tasks from Real Usersโ208Updated 2 months ago
- A Survey on Data Selection for Language Modelsโ203Updated 3 months ago
- [NeurIPS'24 Spotlight] Observational Scaling Lawsโ49Updated 3 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervisionโ112Updated 4 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Modelsโ173Updated 3 months ago
- Code accompanying the paper "Massive Activations in Large Language Models"โ138Updated 10 months ago
- Codes for the paper "โBench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718โ306Updated 4 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformersโ120Updated last month
- ๐พ OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.โ98Updated this week
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ90Updated last month
- The repo for In-context Autoencoderโ104Updated 8 months ago
- Self-Alignment with Principle-Following Reward Modelsโ152Updated 11 months ago
- Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"โ69Updated 2 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)โ103Updated 10 months ago
- โ47Updated 2 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ105Updated 6 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.โ387Updated 9 months ago
- Language models scale reliably with over-training and on downstream tasksโ96Updated 9 months ago
- Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"โ134Updated 3 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safetyโ176Updated 6 months ago