sigmeta / distillation-BERT
knowledge distillation on BERT
☆29Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for distillation-BERT
- End-to-end Speech Translation☆36Updated 3 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated last year
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Updated last year
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆65Updated 4 years ago
- RNNs for Text Normalization☆38Updated 6 years ago
- ☆24Updated 4 years ago
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- A simple n-gram language model.☆10Updated 6 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆58Updated 2 years ago
- code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding☆28Updated 5 years ago
- The baseline model code for WMT 2021 Triangular MT☆13Updated 3 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆30Updated 3 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated last year
- BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer☆101Updated 3 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆34Updated 4 years ago
- The implementation of "Does Multi-Encoder Help? A Case Study on Context-AwareNeural Machine Translation"☆40Updated 4 years ago
- ☆48Updated 2 years ago
- This project attempts to maintain the SOTA performance in machine translation☆108Updated 4 years ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆19Updated 2 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆13Updated last year
- ☆46Updated 2 months ago
- Pytorch implementation of CoCon: A Self-Supervised Approach for Controlled Text Generation☆94Updated 3 years ago
- DSTC9 Multi-Domain Task-Oriented Dialog Challenge II☆34Updated 3 years ago
- Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"☆24Updated 4 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- The code of ACL 2020 paper "Multi-Domain Dialogue Acts and Response Co-Generation"☆33Updated 4 years ago
- Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"☆75Updated 2 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Updated 4 years ago