tqfang / comet-deepspeedLinks
Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
☆14Updated 3 years ago
Alternatives and similar repositories for comet-deepspeed
Users that are interested in comet-deepspeed are comparing it to the libraries listed below
Sorting:
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆27Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆74Updated 3 years ago
- We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…☆43Updated 3 years ago
- Code and resources for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking fo…☆74Updated 3 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Updated last year
- Code for EMNLP 2021 paper "CLIFF: Contrastive Learning for Improving Faithfulness and Factuality in Abstractive Summarization"☆46Updated 3 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Updated 2 years ago
- [ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering☆44Updated 3 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆69Updated 3 years ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated 2 years ago
- An Interpretable Neuro-Symbolic Framework for Task-Oriented Dialogue Generation☆23Updated 3 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 2 years ago
- TBC☆27Updated 3 years ago
- Code of ACL 2022 paper Debiased Contrastive Learning of Unsupervised Sentence Representations☆31Updated 3 years ago
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated 2 years ago
- Codes for the EMNLP2021 paper: Benchmarking Commonsense Knowledge Base Population (https://aclanthology.org/2021.emnlp-main.705.pdf). An …☆26Updated last year
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Updated 3 years ago
- ☆50Updated 2 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆68Updated 4 years ago
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"☆43Updated 3 years ago
- Code for embedding and retrieval research.☆17Updated 2 years ago
- Findings of ACL 2021☆24Updated 4 years ago
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"☆62Updated 2 years ago
- Code for Editing Factual Knowledge in Language Models☆142Updated 3 years ago
- [EMNLP 2022] Code and data for "Controllable Dialogue Simulation with In-Context Learning"☆35Updated 2 years ago
- Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).☆29Updated 3 years ago
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆196Updated 2 years ago
- This project maintains a reading list for general text generation tasks☆66Updated 3 years ago
- ☆46Updated last year
- An original implementation of "Noisy Channel Language Model Prompting for Few-Shot Text Classification"☆131Updated 3 years ago