contrastive decoding
☆207Nov 14, 2022Updated 3 years ago
Alternatives and similar repositories for ContrastiveDecoding
Users that are interested in ContrastiveDecoding are comparing it to the libraries listed below
Sorting:
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆475Mar 7, 2024Updated last year
- Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"☆541Jan 17, 2025Updated last year
- Diffusion-LM☆1,224Aug 8, 2024Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Feb 27, 2024Updated 2 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆19Mar 23, 2024Updated last year
- Repository of the paper "Accelerating Transformer Inference for Translation via Parallel Decoding"☆123Mar 15, 2024Updated last year
- An automatic MLLM hallucination detection framework☆19Sep 26, 2023Updated 2 years ago
- ☆16Jul 11, 2023Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆40Oct 17, 2023Updated 2 years ago
- [TMLR'23] Contrastive Search Is What You Need For Neural Text Generation☆123Mar 5, 2023Updated 2 years ago
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆152May 10, 2023Updated 2 years ago
- ☆98Jun 6, 2022Updated 3 years ago
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated last year
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Nov 14, 2023Updated 2 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- ☆50Feb 5, 2023Updated 3 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- [ICLR'23] DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models☆828Mar 1, 2024Updated last year
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆286Oct 20, 2022Updated 3 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- ☆89Nov 11, 2022Updated 3 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- ☆114Jun 9, 2022Updated 3 years ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆62Mar 30, 2024Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆127Mar 30, 2024Updated last year
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆39Jun 13, 2022Updated 3 years ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆571Jan 28, 2025Updated last year
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆195Jun 14, 2023Updated 2 years ago
- Knowledge Infused Decoding☆71Dec 31, 2023Updated 2 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆457Sep 6, 2023Updated 2 years ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆155Apr 30, 2024Updated last year
- ☆1,560Feb 20, 2026Updated last week
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆231Aug 2, 2024Updated last year
- [EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning☆17Oct 31, 2023Updated 2 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 6 months ago
- ☆20May 30, 2024Updated last year
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago