cybertronai / Megatron-LMLinks
Ongoing research training transformer language models at scale, including: BERT
☆15Updated 6 years ago
Alternatives and similar repositories for Megatron-LM
Users that are interested in Megatron-LM are comparing it to the libraries listed below
Sorting:
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆66Updated 4 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)☆71Updated 3 years ago
- ☆47Updated 4 years ago
- Code for the CIKM 2019 Paper: How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations☆32Updated last year
- NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"☆74Updated 4 years ago
- ☆31Updated 4 years ago
- The implementation of "Neural Machine Translation without Embeddings", NAACL 2021☆33Updated 3 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆79Updated 4 years ago
- A BART version of an open-domain QA model in a closed-book setup☆119Updated 4 years ago
- Hyperparameter Search for AllenNLP☆139Updated 2 months ago
- ☆76Updated 3 years ago
- Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".☆52Updated 5 years ago
- Code accompanying our papers on the "Generative Distributional Control" framework☆118Updated 2 years ago
- Official repository for the paper "Question Answering Infused Pre-training of General-Purpose Contextualized Representations" by Robin Ji…☆15Updated 3 years ago
- REALSumm: Re-evaluating Evaluation in Text Summarization☆71Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 4 years ago
- The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".☆71Updated 10 months ago
- Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework☆52Updated 5 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 2 years ago
- ☆46Updated 5 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- Repository for the CODAH dataset☆22Updated 2 years ago
- Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval☆43Updated last year
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆27Updated 3 years ago
- ☆63Updated 3 years ago