sigmeta / distillation-BERT
knowledge distillation on BERT
☆30Updated 5 years ago
Alternatives and similar repositories for distillation-BERT:
Users that are interested in distillation-BERT are comparing it to the libraries listed below
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Updated 4 years ago
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆24Updated 4 years ago
- ☆39Updated 4 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆62Updated 3 years ago
- This project attempts to maintain the SOTA performance in machine translation☆108Updated 4 years ago
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆66Updated 4 years ago
- code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding☆27Updated 5 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated 2 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆66Updated 5 years ago
- RNNs for Text Normalization☆39Updated 7 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- End-to-end Speech Translation☆36Updated 4 years ago
- Some good(maybe) papers about NMT (Neural Machine Translation).☆84Updated 5 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆13Updated last year
- The baseline model code for WMT 2021 Triangular MT☆13Updated 4 years ago
- ☆49Updated 3 years ago
- ☆24Updated 4 years ago
- BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer☆101Updated 4 years ago
- An "end-to-end trainable task-oriented dialogue model" implementation.☆37Updated 2 years ago
- ☆12Updated 6 years ago
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Updated 4 years ago
- Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"☆77Updated 3 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆66Updated 5 years ago
- datasets for NLP research☆24Updated 3 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆114Updated 9 months ago
- ☆67Updated 2 years ago
- A list of resources about Text Style Transfer☆42Updated 4 years ago