yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated last year
Alternatives and similar repositories for Contrastive_Search_Is_What_You_Need:
Users that are interested in Contrastive_Search_Is_What_You_Need are comparing it to the libraries listed below
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- ☆97Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- Long-context pretrained encoder-decoder models☆94Updated 2 years ago
- ☆96Updated 2 years ago
- Detect hallucinated tokens for conditional sequence generation.☆64Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 10 months ago
- ☆73Updated last year
- Code base of In-Context Learning for Dialogue State tracking☆45Updated last year
- Token-level Reference-free Hallucination Detection☆93Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆68Updated last year
- [NeurIPS'22 Spotlight] Data and code for our paper CoNT: Contrastive Neural Text Generation☆150Updated last year
- The original Backpack Language Model implementation, a fork of FlashAttention☆66Updated last year
- ☆43Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated last year
- Dataset for NAACL 2021 paper: "QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization"☆116Updated last year
- Transformers at any scale☆41Updated last year
- Dense hybrid representations for text retrieval☆62Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 4 months ago
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆137Updated 2 years ago
- ☆23Updated last year
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆99Updated 2 years ago
- Repo for "On Learning to Summarize with Large Language Models as References"☆44Updated last year
- ☆37Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated this week
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆99Updated last year
- Pre-training BART in Flax on The Pile dataset☆20Updated 3 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆99Updated 9 months ago
- A Multilingual Replicable Instruction-Following Model☆94Updated last year
- ☆25Updated 2 years ago