IAAR-Shanghai / Awesome-Attention-Heads
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
☆265Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Awesome-Attention-Heads
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆152Updated 3 weeks ago
- xFinder: Robust and Pinpoint Answer Extraction for Large Language Models☆146Updated 3 weeks ago
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆131Updated 7 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆161Updated this week
- Toolkit for Prompt Compression☆245Updated last month
- [ACL 2024] RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback.☆181Updated 2 months ago
- Controllable Text Generation for Large Language Models: A Survey☆142Updated 2 months ago
- MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆140Updated last month
- Connect agents to live web environments evaluation.☆202Updated 2 months ago
- ☆326Updated 2 weeks ago
- ☆197Updated 2 weeks ago
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆157Updated 3 weeks ago
- Prompt Learning using Metaheuristics☆135Updated 9 months ago
- [🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering☆128Updated 3 weeks ago
- Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering☆102Updated 5 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆174Updated last month
- Benchmarking LLMs via Uncertainty Quantification☆221Updated 9 months ago
- Collection of Reverse Engineering in Large Model☆30Updated 2 weeks ago
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆143Updated last month
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆160Updated 3 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆92Updated 2 months ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆184Updated last week
- Multi-Agent-GPT: 一款基于RAG和agent构建的多模态专家助手GPT。它集成了文本、图像和音频等模态工具。支持本地部署和私有数据库建设。☆221Updated 9 months ago
- MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models☆387Updated 9 months ago
- A recipe for online RLHF and online iterative DPO.☆434Updated last week
- Grimoire is All You Need for Enhancing Large Language Models☆116Updated 8 months ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆76Updated this week
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆24Updated last week
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆69Updated last month
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆147Updated 5 months ago