tqfang / comet-deepspeedLinks

Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.

☆14

Alternatives and similar repositories for comet-deepspeed

Users that are interested in comet-deepspeed are comparing it to the libraries listed below

Sorting:

princeton-nlp / EvalConvQA
[ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering
☆45Updated 3 years ago
TevenLeScao / pet
This repository contains the code for "How many data points is a prompt worth?"
☆48Updated 4 years ago
SiyuanWangw / StepwiseQA
The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".
☆21Updated 2 years ago
izhx / uni-rep
Code for embedding and retrieval research.
☆16Updated last year
thunlp / ConvDR
Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
☆41Updated 3 years ago
HanNight / RE-T5
Code and data for "Retrieval Enhanced Model for Commonsense Generation" (ACL-IJCNLP 2021).
☆28Updated 3 years ago
txsun1997 / Metric-Fairness
EMNLP'2022: BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation
☆41Updated 2 years ago
lyutyuh / structured-span-selector
A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…
☆21Updated 3 years ago
swj0419 / kNN_prompt
TBC
☆27Updated 2 years ago
peterwestuw / surface-form-competition
☆58Updated 3 years ago
wenhuchen / Time-Sensitive-QA
Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"
☆72Updated 3 years ago
yumeng5 / SuperGen
[NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding
☆67Updated 2 years ago
salesforce / DialFact
We construct and introduce DIALFACT, a testing benchmark dataset crowd-annotated conversational claims, paired with pieces of evidence fr…
☆41Updated 2 years ago
taoshen58 / LexMAE
☆21Updated 2 years ago
jiacheng-ye / ZeroGen
[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.
☆48Updated 3 years ago
HKUST-KnowComp / WinoWhy
WinoWhy provides human-annotated reasons for answering WSC questions.
☆18Updated 5 years ago
AkariAsai / evidentiality_qa
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆44Updated 2 years ago
HKUNLP / ZeroGen
[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.
☆16Updated 3 years ago
microsoft / SEED-Encoder
☆45Updated 3 years ago
krystalan / chatgpt_as_nlg_evaluator
Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study
☆43Updated 2 years ago
OpenMatch / ANCE-Tele
Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…
☆18Updated last year
wyu97 / RACo
Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.
☆22Updated 2 years ago
Liyan06 / AggreFact
Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)
☆25Updated last year
XinyuanLu00 / SciTab
The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"
☆22Updated last year
yanzhangnlp / BSL
Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)
☆30Updated 3 years ago
swarnaHub / ExplaGraphs
[EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning
☆12Updated 2 years ago
prakharguptaz / Instructdial
Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning
☆100Updated 2 years ago
microsoft / REINA
☆117Updated 3 years ago
littlehacker26 / Discriminator-Cooperative-Unlikelihood-Prompt-Tuning
The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…
☆26Updated last year
jxhe / efficient-knnlm
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)
☆73Updated 3 years ago