voidism / Lookback-LensLinks
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆127Updated 10 months ago
Alternatives and similar repositories for Lookback-Lens
Users that are interested in Lookback-Lens are comparing it to the libraries listed below
Sorting:
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆177Updated 4 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆45Updated 5 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆106Updated 8 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆60Updated 3 months ago
- [ACL'25] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆64Updated last week
- ☆36Updated 5 months ago
- ☆85Updated 7 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆115Updated last year
- ☆123Updated 8 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆54Updated last month
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆170Updated 3 weeks ago
- ☆150Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Updated 8 months ago
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆88Updated last year
- ☆180Updated 2 months ago
- The first dense retrieval model that can be prompted like an LM☆73Updated last month
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆76Updated 6 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆205Updated 2 weeks ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆105Updated 4 months ago
- ☆97Updated 11 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆112Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆144Updated 7 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆144Updated last month
- Critique-out-Loud Reward Models☆66Updated 8 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆106Updated 5 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆159Updated 2 weeks ago
- ☆115Updated 4 months ago
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆148Updated 7 months ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆193Updated last year
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆87Updated last year