nightdessert / Retrieval_HeadLinks

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

☆205

Alternatives and similar repositories for Retrieval_Head

Users that are interested in Retrieval_Head are comparing it to the libraries listed below

Sorting:

princeton-nlp / ProLong
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆218Updated 5 months ago
princeton-nlp / HELMET
The HELMET Benchmark
☆162Updated 3 months ago
QwenLM / ProcessBench
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆167Updated 2 months ago
hkust-nlp / dart-math
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆111Updated 8 months ago
da03 / Internalize_CoT_Step_by_Step
☆187Updated 3 months ago
TIGER-AI-Lab / LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]
☆105Updated 5 months ago
haonan3 / AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
☆212Updated 6 months ago
cmu-l3 / l1
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning
☆237Updated 2 months ago
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆428Updated last year
getao / icae
The repo for In-context Autoencoder
☆133Updated last year
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆114Updated last year
thu-wyz / inference_scaling
☆71Updated 8 months ago
HKUNLP / STRING
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆77Updated 8 months ago
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆239Updated last year
PRIME-RL / ImplicitPRM
Repo of paper "Free Process Rewards without Process Labels"
☆161Updated 4 months ago
princeton-pli / LongProc
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
☆26Updated last month
genrm-star / genrm-critiques
GenRM-CoT: Data release for verification rationales
☆63Updated 9 months ago
roeehendel / icl_task_vectors
☆96Updated last year
eddycmu / demystify-long-cot
☆310Updated 2 months ago
RZFan525 / Awesome-ScalingLaws
A curated list of awesome resources dedicated to Scaling Laws for LLMs
☆77Updated 2 years ago
Glaciohound / LM-Infinite
Implementation of NAACL 2024 Outstanding Paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆148Updated 4 months ago
GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆161Updated 2 weeks ago
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆176Updated 3 months ago
kanishkg / cognitive-behaviors
☆203Updated 4 months ago
ryoungj / ObsScaling
[NeurIPS'24 Spotlight] Observational Scaling Laws
☆56Updated 10 months ago
TIGER-AI-Lab / verl-tool
A version of verl to support tool use
☆315Updated this week
princeton-nlp / CEPE
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆157Updated last year
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆309Updated 11 months ago
GAIR-NLP / LIMR
☆206Updated 5 months ago
hkust-nlp / llm-compression-intelligence
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆140Updated 10 months ago