sigmeta / distillation-BERT
knowledge distillation on BERT
☆30Updated 5 years ago
Alternatives and similar repositories for distillation-BERT:
Users that are interested in distillation-BERT are comparing it to the libraries listed below
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Updated 4 years ago
- ☆38Updated 4 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆61Updated 3 years ago
- ☆12Updated 6 years ago
- The baseline model code for WMT 2021 Triangular MT☆13Updated 4 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated 2 years ago
- code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding☆27Updated 5 years ago
- This project attempts to maintain the SOTA performance in machine translation☆108Updated 4 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆34Updated 4 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- ☆49Updated 3 years ago
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- Latest research advances on semantic slot filling.☆25Updated 2 years ago
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆66Updated 4 years ago
- Code for "A Unified Model for Joint Chinese Word Segmentation and Dependency Parsing"☆38Updated 2 years ago
- soft_mask_bert model for Chinese Spelling Correction in keras☆21Updated 4 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer☆101Updated 4 years ago
- ☆46Updated 7 months ago
- Some good(maybe) papers about NMT (Neural Machine Translation).☆84Updated 5 years ago
- Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"☆32Updated 2 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 3 years ago
- Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"☆77Updated 3 years ago
- RNNs for Text Normalization☆38Updated 7 years ago
- A Bert-CNN-LSTM model for punctuation restoration☆57Updated last year
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆13Updated last year
- Code and data for the NAACL 2019 paper "Improving Cross-Domain Chinese Word Segmentation with Word Embeddings"☆10Updated 5 years ago
- ☆46Updated 3 years ago
- End-to-end Speech Translation☆36Updated 4 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago