ZhengZixiang / ATPapersLinks

Worth-reading papers and related resources on attention mechanism, Transformer and pretrained language model (PLM) such as BERT. 值得一读的注意力机制、Transformer和预训练语言模型论文与相关资源集合

☆131

Alternatives and similar repositories for ATPapers

Users that are interested in ATPapers are comparing it to the libraries listed below

Sorting:

eaglenlp / Text-Generation
☆93Updated 5 years ago
lonePatient / BERT-SDA
A PyTorch implementation of "Improving BERT Fine-Tuning via Self-Ensemble and Self-Distillation"
☆56Updated 5 years ago
xcfcode / What-I-Have-Read
Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers
☆165Updated 3 years ago
ShannonAI / Neural-Semi-Supervised-Learning-for-Text-Classification
Semi-supervised Learning for Sentiment Analysis
☆54Updated 4 years ago
Sanyuan-Chen / RecAdam
Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.
☆118Updated 4 years ago
eaglenlp / Text-Matching
☆24Updated 5 years ago
FreedomIntelligence / complex-order
☆83Updated 5 years ago
intersun / PKD-for-BERT-Model-Compression
pytorch implementation for Patient Knowledge Distillation for BERT Model Compression
☆203Updated 5 years ago
lxk00 / BERT-EMD
☆50Updated 2 years ago
JetRunner / BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).
☆313Updated 2 years ago
ha-lins / MetaLearning4NLP-Papers
A list of recent papers about Meta / few-shot learning methods applied in NLP areas.
☆231Updated 4 years ago
asappresearch / revisit-bert-finetuning
For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).
☆184Updated 2 years ago
jcyk / BERT
a simple yet complete implementation of the popular BERT model
☆127Updated 5 years ago
MC-BERT / MC-BERT
☆96Updated 5 years ago
jinfengr / hcan
EMNLP'19: Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling
☆77Updated 2 years ago
joongbo / tta
Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"
☆109Updated 4 years ago
lonePatient / electra_pytorch
ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators
☆91Updated 3 years ago
xanhho / Reading-Comprehension-Question-Answering-Papers
Survey on Machine Reading Comprehension
☆148Updated 4 years ago
kepei1106 / ARAML
Codes for our paper at EMNLP2019
☆36Updated 5 years ago
shuohangwang / Cross-Thought
☆47Updated 4 years ago
sfzhou5678 / PretrainedLittleBERTs
24*2个预训练的小型BERT模型，NLPer炼丹利器
☆50Updated 5 years ago
zxlzr / MTM
MTM
☆142Updated 2 years ago
zhuchen03 / FreeLB
Adversarial Training for Natural Language Understanding
☆251Updated last year
llamazing / numnet_plus
This is the official code repository for NumNet+(https://leaderboard.allenai.org/drop/submission/blu418v76glsbnh1qvd0)
☆177Updated last year
StonyBrookNLP / deformer
[ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering
☆120Updated 2 years ago
Gorov / DiverseFewShot_Amazon
☆121Updated 6 years ago
bojone / keras_adversarial_training
Adversarial Training for NLP in Keras
☆46Updated 5 years ago
WangJiuniu / adversarial_training
Pytorch implementation of the methods proposed in **Adversarial Training Methods for Semi-Supervised Text Classification** on IMDB datase…
☆42Updated 6 years ago
52paper / 52paper.github.io
☆75Updated 2 years ago
bernhard2202 / rankqa
This is the PyTorch implementation of the ACL 2019 paper RankQA: Neural Question Answering with Answer Re-Ranking.
☆83Updated 3 years ago