sigmeta / distillation-BERTLinks

knowledge distillation on BERT

☆29

Alternatives and similar repositories for distillation-BERT

Users that are interested in distillation-BERT are comparing it to the libraries listed below

Sorting:

ZNLP / SOTA-MT
This project attempts to maintain the SOTA performance in machine translation
☆108Updated 4 years ago
Chia-Hsuan-Lee / ODSQA
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
☆62Updated 3 years ago
libeineu / Context-Aware
The implementation of "Does Multi-Encoder Help? A Case Study on Context-AwareNeural Machine Translation"
☆40Updated 4 years ago
scir-zywang / self-training-self-supervised-disfluency
☆39Updated 4 years ago
yd1996 / awesome-text-style-transfer
A list of resources about Text Style Transfer
☆42Updated 5 years ago
joongbo / tta
Repository for the paper "Fast and Accurate Deep Bidirectional Language Representations for Unsupervised Learning"
☆109Updated 4 years ago
MenNianShi / PunctuationPrediction-BERT
Use bert to predict punctuation on IWSLT2012 and The People's Daily 2014
☆66Updated 5 years ago
simplc / WCN-BERT
Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).
☆13Updated 2 years ago
m3yrin / nar-latent-alignment
Unofficial implementation of "Non-Autoregressive Machine Translation with Latent Alignments" https://arxiv.org/abs/2004.07437
☆24Updated 5 years ago
lemon234071 / GPT-Chinese
A large-scale cleaned Chinese chitchat corpus and Chinese dialogpt models
☆34Updated 4 years ago
ishalyminov / multitask_disfluency_detection
Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…
☆24Updated 2 years ago
sonos / spoken-language-understanding-research-datasets
☆49Updated 3 years ago
Alibaba-NLP / DAAT-CWS
Coupling Distant Annotation and Adversarial Training for Cross-Domain Chinese Word Segmentation
☆22Updated 4 years ago
nkrnrnk / BertPunc
SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model
☆181Updated 6 years ago
RayeRen / multilingual-kd-pytorch
ICLR2019, Multilingual Neural Machine Translation with Knowledge Distillation
☆70Updated 4 years ago
shauryr / google_text_normalization
RNNs for Text Normalization
☆39Updated 7 years ago
wangqiangneu / dlcl
The implementation of "Learning Deep Transformer Models for Machine Translation"
☆116Updated 11 months ago
MiuLab / SpokenVec
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
☆24Updated 2 years ago
fastnlp / JointCwsParser
Code for "A Unified Model for Joint Chinese Word Segmentation and Dependency Parsing"
☆38Updated 3 years ago
lipiji / Guyu
Chinese GPT2: pre-training and fine-tuning framework for text generation
☆187Updated 4 years ago
90217 / joint-intent-classification-and-slot-filling-based-on-BERT
BERT for joint intent classification and slot filling
☆39Updated 5 years ago
lemmonation / abnet
Code for NeurIPS2020 "Incorporating BERT into Parallel Sequence Decoding with Adapters"
☆32Updated 2 years ago
Adaxry / CM-Net
code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding
☆27Updated 5 years ago
guanlinchao / bert-dst
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer
☆101Updated 4 years ago
lemmonation / jm-nat
Code for ACL2020 "Jointly Masked Sequence-to-Sequence Model for Non-Autoregressive Neural Machine Translation"
☆39Updated 5 years ago
wszlong / sb-nmt
Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)
☆66Updated 6 years ago
linzehui / mRASP
☆167Updated 3 years ago
LeePleased / StackPropagation-SLU
Open source code for EMNLP-19 Paper "A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding".
☆147Updated 4 years ago
lipiji / SongNet
Code for ACL 2020 paper "Rigid Formats Controlled Text Generation"：https://www.aclweb.org/anthology/2020.acl-main.68/
☆236Updated 4 years ago
SimulTrans-demo / STACL
code for STACL(STACL: Simultaneous Translation with Integrated Anticipation and Controllable Latency)
☆8Updated 5 years ago