cybertronai / Megatron-LMLinks
Ongoing research training transformer language models at scale, including: BERT
☆16Updated 6 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below
Sorting:
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 5 years ago
- NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"☆73Updated 5 years ago
- Cross-lingual GLUE☆49Updated 2 years ago
- Temporal Commonsense Reasoning in Dialog☆72Updated 4 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆93Updated 3 years ago
- ☆48Updated 5 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆138Updated 2 years ago
- Code from the paper "What do Models Learn from Question Answering Datasets?" (EMNLP 2020)☆54Updated 5 years ago
- Assessing syntactic abilities of BERT☆40Updated 6 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆119Updated 4 years ago
- Code for the paper "True Few-Shot Learning in Language Models" (https://arxiv.org/abs/2105.11447)☆144Updated 4 years ago
- Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)☆72Updated 4 years ago
- Code for the CIKM 2019 Paper: How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations☆32Updated 2 years ago
- Useful python NLP tools (evaluation, GUI interface, tokenization)☆45Updated 5 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 4 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated 3 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆120Updated 3 years ago
- Training T5 to perform numerical reasoning.☆23Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Updated 3 years ago
- ☆40Updated 5 years ago
- Helper scripts and notes that were used while porting various nlp models☆49Updated 3 years ago
- ☆132Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 3 years ago
- This repository maintains the QAConv dataset, a question-answering dataset on informative conversations including business emails, panel …☆84Updated last year
- A question-answering dataset with a focus on subjective information☆48Updated 2 years ago
- ☆97Updated 3 years ago
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Updated 3 years ago
- PyTorch original implementation of "Unsupervised Question Decomposition for Question Answering"☆122Updated 2 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆67Updated 5 years ago