laiguokun / bert-cloth
☆40Updated 4 years ago
Alternatives and similar repositories for bert-cloth:
Users that are interested in bert-cloth are comparing it to the libraries listed below
- ☆78Updated 2 years ago
- Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"☆128Updated 3 years ago
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆115Updated 4 years ago
- ☆69Updated 4 years ago
- The source code for ACL 2020 paper Exclusive Hierarchical Decoding for Deep Keyphrase Generation☆55Updated 2 years ago
- Pretrain CPM-1☆51Updated 3 years ago
- Code for ACL 2019 paper: "Searching for Effective Neural Extractive Summarization: What Works and What's Next"☆90Updated 3 years ago
- A PyTorch implementation of Google AI's BERT model provided with Google's pre-trained models, examples and utilities.☆71Updated 2 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆250Updated 3 years ago
- Differentiable Product Quantization for End-to-End Embedding Compression.☆60Updated 2 years ago
- Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…☆24Updated 5 years ago
- End-To-End Memory Networks in PyTorch☆38Updated 7 years ago
- ☆33Updated 4 years ago
- A list of recent papers on knowledge-based machine reading comprehension.☆26Updated 4 years ago
- Deep learning images developed from nvidia/cuda-cudnn-devel-ubuntu.☆23Updated 2 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆171Updated 4 years ago
- The source code of our ACL2019 paper "Incremental Transformer with Deliberation Decoder for Document Grounded Conversations "☆86Updated 5 years ago
- Source code for "Efficient Training of BERT by Progressively Stacking"☆112Updated 5 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- Re-implement "QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension"☆120Updated 6 years ago
- ☆50Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- "Cross-lingual Distillation for Text Classification" 55th annual meeting of the Association for Computational Linguistics (ACL 2017)☆18Updated 7 years ago
- Non-Monotonic Sequential Text Generation (ICML 2019)☆72Updated 5 years ago
- ☆38Updated 5 years ago
- Conversational Toolkit. An Open-Source Toolkit for Fast Development and Fair Evaluation of Text Generation☆127Updated 4 years ago
- Visualization for simple attention and Google's multi-head attention.☆68Updated 7 years ago
- Official implementation of "User Attention-guided Multimodal Dialog Systems"☆52Updated 4 years ago
- bert annotation, input and output for people from scratch, 代码注释, 有每一步的输入和输出, 适合初学者☆93Updated 2 years ago
- ☆83Updated 5 years ago