matthew-cavener / my-bert-is-too-big
Doing Knowledge Distillation on BERT because the inference time is too damn high!
☆9Updated 5 years ago
Alternatives and similar repositories for my-bert-is-too-big:
Users that are interested in my-bert-is-too-big are comparing it to the libraries listed below
- ☆22Updated 3 years ago
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Updated last month
- Backtranslations of IMDB movie reviews for Data Augmentation Purposes☆11Updated 6 years ago
- ☆20Updated 5 years ago
- source code of bison☆26Updated 4 years ago
- Tools for training pytorch language models☆27Updated 4 years ago
- NoiseMix - data generation for natural language☆40Updated 6 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 4 years ago
- We summarize the summarization papers presented at major conferences (starting with ACL 2019)☆85Updated 5 years ago
- Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)☆32Updated 2 years ago
- Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers☆32Updated 3 years ago
- Code and data for the paper "Soft Gazetteers for Low-resource Named Entity Recognition"☆19Updated 4 years ago
- ☆83Updated 5 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆36Updated 4 years ago
- ☆33Updated 6 years ago
- ☆35Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 4 months ago
- The Referential Reader: A Recurrent Entity Network for Anaphora Resolution, published at ACL 2019☆19Updated 5 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Updated 5 years ago
- Massively Multilingual Transfer for NER☆86Updated 3 years ago
- Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"☆19Updated 5 years ago
- ☆31Updated 4 years ago
- Phrase-Indexed Question Answering (PIQA)☆94Updated 5 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆107Updated 2 years ago
- ☆46Updated 5 years ago
- [NAACL 2019] code for "Pragmatically Informative Text Generation" https://arxiv.org/abs/1904.01301☆47Updated 5 years ago
- ☆33Updated 5 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 5 years ago
- Neural (LSTM) version of the partial CRF model☆35Updated 5 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 4 years ago