voidism / Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆109Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Lookback-Lens
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated 6 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆93Updated last month
- ☆123Updated 6 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- This is the official repository for Inheritune.☆105Updated last month
- The first dense retrieval model that can be prompted like an LM☆62Updated last month
- ☆111Updated last month
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆58Updated 7 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆100Updated 3 weeks ago
- Codebase accompanying the Summary of a Haystack paper.☆71Updated last month
- Benchmarking LLMs with Challenging Tasks from Real Users☆194Updated this week
- A simple unified framework for evaluating LLMs☆138Updated this week
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆142Updated 3 weeks ago
- ☆41Updated this week
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆66Updated 10 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 8 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 6 months ago
- Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models☆80Updated 7 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆76Updated 7 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆105Updated last year
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆34Updated 2 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆128Updated this week
- Code accompanying "How I learned to start worrying about prompt formatting".☆92Updated last month
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆106Updated 2 weeks ago
- ☆116Updated 5 months ago
- ☆44Updated last month
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 3 months ago
- ☆56Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 2 weeks ago