lonePatient / electra_pytorch
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆91Updated 3 years ago
Alternatives and similar repositories for electra_pytorch:
Users that are interested in electra_pytorch are comparing it to the libraries listed below
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆115Updated 4 years ago
- ⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).☆309Updated last year
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"☆56Updated 4 years ago
- Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation☆93Updated 3 years ago
- Some good(maybe) papers about NMT (Neural Machine Translation).☆84Updated 5 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆120Updated last year
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated last year
- ☆176Updated 3 years ago
- ☆78Updated 2 years ago
- CIKM 2020: Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots☆74Updated 4 years ago
- Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"☆109Updated 4 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆133Updated last year
- ☆96Updated 4 years ago
- Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue. In ICLR, 2020 (spotlight)☆129Updated 4 years ago
- ☆93Updated 4 years ago
- This is the official code repository for NumNet+(https://leaderboard.allenai.org/drop/submission/blu418v76glsbnh1qvd0)☆177Updated 7 months ago
- The official implementation of EMNLP 2020, "A Simple and Effective Model for Answering Multi-span Questions".☆157Updated last year
- Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling☆68Updated 5 years ago
- Leaderboards, Datasets and Papers for Multi-Turn Response Selection in Retrieval-Based Chatbots☆203Updated 3 years ago
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆131Updated 3 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆226Updated 3 years ago
- a simple yet complete implementation of the popular BERT model☆127Updated 4 years ago
- SpanNER: Named EntityRe-/Recognition as Span Prediction☆126Updated 2 years ago
- ☆62Updated 5 years ago
- ☆112Updated 3 years ago
- Multi-turn dialogue baselines written in PyTorch☆161Updated 4 years ago
- Resources for the MRQA 2019 Shared Task☆292Updated 3 years ago
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆107Updated 4 years ago
- ☆66Updated 2 years ago