voidism / Lookback-LensLinks
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆130Updated last year
Alternatives and similar repositories for Lookback-Lens
Users that are interested in Lookback-Lens are comparing it to the libraries listed below
Sorting:
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆110Updated 11 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆118Updated last year
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆48Updated 7 months ago
- The first dense retrieval model that can be prompted like an LM☆86Updated 4 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆185Updated 7 months ago
- ☆154Updated last year
- This is the official repository for Inheritune.☆113Updated 7 months ago
- ☆98Updated 10 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆143Updated 10 months ago
- ☆74Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆153Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆198Updated 2 months ago
- Complex Function Calling Benchmark.☆132Updated 7 months ago
- ☆127Updated 11 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆215Updated last month
- Code accompanying "How I learned to start worrying about prompt formatting".☆110Updated 3 months ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆72Updated 2 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆64Updated 6 months ago
- Evaluating LLMs with fewer examples☆161Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆148Updated 10 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 11 months ago
- ☆100Updated last year
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆194Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆71Updated 2 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆51Updated 9 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆171Updated 2 months ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆89Updated last year
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆88Updated last year
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆54Updated 3 months ago