sigmeta / distillation-BERT
knowledge distillation on BERT
☆30Updated 4 years ago
Alternatives and similar repositories for distillation-BERT:
Users that are interested in distillation-BERT are comparing it to the libraries listed below
- Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014☆65Updated 4 years ago
- Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation☆22Updated 4 years ago
- Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"☆76Updated 3 years ago
- Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding☆24Updated 2 years ago
- code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding☆27Updated 5 years ago
- A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models☆34Updated 4 years ago
- This project attempts to maintain the SOTA performance in machine translation☆108Updated 4 years ago
- ☆12Updated 6 years ago
- ☆37Updated 4 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated 2 years ago
- ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation☆70Updated 4 years ago
- ☆48Updated 2 years ago
- ☆46Updated 5 months ago
- The code of ACL 2020 paper "Multi-Domain Dialogue Acts and Response Co-Generation"☆33Updated 4 years ago
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆13Updated last year
- BERT for joint intent classification and slot filling☆39Updated 5 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 2 years ago
- ☆31Updated 2 years ago
- kenlm语言模型,并提供python的rest服务☆29Updated 6 years ago
- The baseline model code for WMT 2021 Triangular MT☆13Updated 3 years ago
- This repository is for the paper "Confusionset-guided Pointer Networks for Chinese Spelling Check"☆58Updated 5 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 2 years ago
- ☆59Updated 5 years ago
- Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"☆39Updated 4 years ago
- End-to-end Speech Translation☆36Updated 3 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Updated 3 years ago
- The implementation of "Learning Deep Transformer Models for Machine Translation"☆114Updated 6 months ago
- A pytorch implementation for the MMI-anti model☆33Updated 6 years ago
- BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer☆101Updated 3 years ago