laiguokun / bert-clothLinks
☆39Updated 5 years ago
Alternatives and similar repositories for bert-cloth
Users that are interested in bert-cloth are comparing it to the libraries listed below
Sorting:
- ☆78Updated 2 years ago
- Source code of paper "BP-Transformer: Modelling Long-Range Context via Binary Partitioning"☆128Updated 4 years ago
- Pretrain CPM-1☆51Updated 4 years ago
- Differentiable Product Quantization for End-to-End Embedding Compression.☆62Updated 2 years ago
- ☆50Updated last year
- Deep learning images developed from nvidia/cuda-cudnn-devel-ubuntu.☆23Updated 2 years ago
- Code for the RecAdam paper: Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting.☆117Updated 4 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆251Updated 3 years ago
- ☆69Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- Visualization for simple attention and Google's multi-head attention.☆67Updated 7 years ago
- ☆83Updated 5 years ago
- Source code for "Efficient Training of BERT by Progressively Stacking"☆112Updated 5 years ago
- The source code for ACL 2020 paper Exclusive Hierarchical Decoding for Deep Keyphrase Generation☆55Updated 2 years ago
- ☆252Updated 2 years ago
- LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts (AAAI 2019)☆122Updated 6 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆26Updated 4 years ago
- ☆33Updated 5 years ago
- Re-implementation of BiDAF(Bidirectional Attention Flow for Machine Comprehension, Minjoon Seo et al., ICLR 2017) on PyTorch.☆245Updated 9 months ago
- Implementation of Neural Machine Translation by jointly learning to align and translate☆26Updated 7 years ago
- ☆17Updated 2 years ago
- Heterogeneous Graph Transformer for Graph-to-Sequence Learning☆48Updated 4 years ago
- Notes of my introduction about NLP in Fudan University☆37Updated 3 years ago
- A simple implementation of LeakGAN in PyTorch☆63Updated 3 years ago
- codes accompanying ACL 2019 paper Graph Neural Networks with Generated Parameters for Relation Extraction☆12Updated 5 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- Code and Models for paper "AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning. Han Guo, Ramakanth Pasunuru, and Mohit Ba…☆24Updated 6 years ago
- 📒Record some paper read notes☆20Updated 3 years ago
- ☆38Updated 5 years ago