voidism / Lookback-LensLinks
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆142Updated 3 months ago
Alternatives and similar repositories for Lookback-Lens
Users that are interested in Lookback-Lens are comparing it to the libraries listed below
Sorting:
- ☆161Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆126Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆197Updated 11 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆50Updated last year
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆75Updated 7 months ago
- ☆123Updated 11 months ago
- ☆107Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆245Updated last year
- ☆130Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆113Updated 7 months ago
- The first dense retrieval model that can be prompted like an LM☆90Updated 8 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆146Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆156Updated 2 years ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated last year
- ☆99Updated last year
- [TMLR 2026] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models☆121Updated 11 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆217Updated 7 months ago
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆89Updated last year
- ☆52Updated 8 months ago
- Evaluating LLMs with fewer examples☆169Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆148Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆178Updated 6 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Updated 2 years ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆163Updated 2 months ago
- Complex Function Calling Benchmark.☆163Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆58Updated 11 months ago