cybertronai / Megatron-LMLinks

Ongoing research training transformer language models at scale, including: BERT

☆15

Alternatives and similar repositories for Megatron-LM

Users that are interested in Megatron-LM are comparing it to the libraries listed below

Sorting:

MiuLab / DuaLUG
The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…
☆66Updated 4 years ago
facebookresearch / QA-Overlap
Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"
☆66Updated 3 years ago
facebookresearch / accentor
Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)
☆71Updated 3 years ago
allenai / sledgehammer
☆47Updated 4 years ago
bvanaken / explain-BERT-QA
Code for the CIKM 2019 Paper: How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer Representations
☆32Updated last year
golsun / SpaceFusion
NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"
☆74Updated 4 years ago
efficientqa / nq-open
☆31Updated 4 years ago
UriSha / EmbeddinglessNMT
The implementation of "Neural Machine Translation without Embeddings", NAACL 2021
☆33Updated 3 years ago
AkariAsai / XORQA
This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".
☆79Updated 4 years ago
shmsw25 / bart-closed-book-qa
A BART version of an open-domain QA model in a closed-book setup
☆119Updated 4 years ago
allenai / allentune
Hyperparameter Search for AllenNLP
☆139Updated 2 months ago
jwieting / paraphrastic-representations-at-scale
☆76Updated 3 years ago
jwieting / beyond-bleu
Python code for training models in the ACL paper, "Beyond BLEU:Training Neural Machine Translation with Semantic Similarity".
☆52Updated 5 years ago
naver / gdc
Code accompanying our papers on the "Generative Distributional Control" framework
☆118Updated 2 years ago
facebookresearch / quip
Official repository for the paper "Question Answering Infused Pre-training of General-Purpose Contextualized Representations" by Robin Ji…
☆15Updated 3 years ago
neulab / REALSumm
REALSumm: Re-evaluating Evaluation in Text Summarization
☆71Updated 2 years ago
facebookresearch / reconsider
ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…
☆49Updated 4 years ago
AkariAsai / logic_guided_qa
The official implementation of ACL 2020, "Logic-Guided Data Augmentation and Regularization for Consistent Question Answering".
☆71Updated 10 months ago
thespectrewithin / joint_align
Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework
☆52Updated 5 years ago
AkariAsai / extractive_rc_by_runtime_mt
Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"
☆40Updated 6 years ago
sebastianruder / emnlp2021-multiqa-tutorial
EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering
☆38Updated 3 years ago
intersun / CoDIR
Code for EMNLP 2020 paper CoDIR
☆41Updated 2 years ago
alontalmor / oLMpics
☆46Updated 5 years ago
ofirpress / sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …
☆55Updated 4 years ago
cambridgeltl / parameter-factorization
Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer
☆39Updated 4 years ago
Websail-NU / CODAH
Repository for the CODAH dataset
☆22Updated 2 years ago
xwhan / ProQA
Progressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval
☆43Updated last year
sobamchan / pytorch-lightning-transformers
Fine-tune transformers with pytorch-lightning
☆44Updated 3 years ago
adapter-hub / hgiyt
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆27Updated 3 years ago
nng555 / ssmba
☆63Updated 3 years ago