sigmeta / distillation-BERTLinks
knowledge distillation on BERT
☆29Updated 5 years ago
Alternatives and similar repositories for distillation-BERT
Users that are interested in distillation-BERT are comparing it to the libraries listed below
Sorting:
- This project attempts to maintain the SOTA performance in machine translation☆108Updated 4 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆62Updated 3 years ago
- The implementation of "Does Multi-Encoder Help? A Case Study on Context-AwareNeural Machine Translation"☆40Updated 4 years ago
- ☆39Updated 4 years ago
- A list of resources about Text Style Transfer☆42Updated 5 years ago
- Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"☆109Updated 4 years ago
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆66Updated 5 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆13Updated 2 years ago
- Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437☆24Updated 5 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆34Updated 4 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated 2 years ago
- ☆49Updated 3 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Updated 4 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- RNNs for Text Normalization☆39Updated 7 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆116Updated 11 months ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- Code for "A Unified Model for Joint Chinese Word Segmentation and Dependency Parsing"☆38Updated 3 years ago
- Chinese GPT2: pre-training and fine-tuning framework for text generation☆187Updated 4 years ago
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding☆27Updated 5 years ago
- BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer☆101Updated 4 years ago
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Updated 5 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆66Updated 6 years ago
- ☆167Updated 3 years ago
- Open source code for EMNLP-19 Paper "A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding".☆147Updated 4 years ago
- Code for ACL 2020 paper "Rigid Formats Controlled Text Generation":https://www.aclweb.org/anthology/2020.acl-main.68/