IAAR-Shanghai / Awesome-Attention-HeadsLinks
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
☆352Updated 3 months ago
Alternatives and similar repositories for Awesome-Attention-Heads
Users that are interested in Awesome-Attention-Heads are comparing it to the libraries listed below
Sorting:
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆181Updated 7 months ago
- [ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation☆175Updated 4 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆169Updated 6 months ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆146Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆253Updated 3 months ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆260Updated 2 weeks ago
- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆227Updated 8 months ago
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆190Updated 9 months ago
- A recipe for online RLHF and online iterative DPO.☆520Updated 5 months ago
- ☆208Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆222Updated last month
- [ICLR 2025] Tool-Planner: Task Planning with Clusters across Multiple Tools☆113Updated last month
- Recipes to train the self-rewarding reasoning LLMs.☆223Updated 3 months ago
- Collection of Reverse Engineering in Large Model☆32Updated 5 months ago
- Toolkit for Prompt Compression☆266Updated 4 months ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆85Updated 6 months ago
- ☆202Updated 7 months ago
- ☆355Updated last week
- ICML 2025 Spotlight☆203Updated this week
- ☆363Updated 3 weeks ago
- The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.☆389Updated 6 months ago
- Benchmarking LLMs via Uncertainty Quantification☆234Updated last year
- A Survey on Data Selection for Language Models☆237Updated last month
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆271Updated last week
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆99Updated 10 months ago
- Controllable Text Generation for Large Language Models: A Survey☆179Updated 10 months ago
- Survey of Small Language Models from Penn State, ...☆183Updated last month
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆54Updated 7 months ago
- ☆109Updated 3 months ago
- ☆203Updated 4 months ago