deep-spin / lmt_hallucinationsLinks
☆18Updated 2 years ago
Alternatives and similar repositories for lmt_hallucinations
Users that are interested in lmt_hallucinations are comparing it to the libraries listed below
Sorting:
- ☆16Updated 2 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆34Updated 2 years ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆80Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆22Updated 3 years ago
- ☆43Updated last year
- A curated list of research papers and resources on Cultural LLM.☆53Updated last year
- ☆20Updated 2 years ago
- ☆22Updated 2 years ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆83Updated last year
- The original Backpack Language Model implementation, a fork of FlashAttention☆71Updated 2 years ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- ☆91Updated last year
- ☆58Updated 2 years ago
- ☆104Updated 2 years ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆99Updated 4 years ago
- ☆13Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 3 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆63Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆90Updated 2 years ago
- Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"☆33Updated last year
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆104Updated 3 years ago
- ☆13Updated last year
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆26Updated 11 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆107Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Updated 3 years ago
- ☆18Updated 3 years ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆60Updated 2 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Updated 2 years ago