matthew-cavener / my-bert-is-too-big
Doing Knowledge Distillation on BERT because the inference time is too damn high!
☆9Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for my-bert-is-too-big
- source code of bison☆26Updated 4 years ago
- NoiseMix - data generation for natural language☆41Updated 6 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 4 years ago
- ☆32Updated 5 years ago
- Backtranslations of IMDB movie reviews for Data Augmentation Purposes☆11Updated 5 years ago
- NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"☆74Updated 4 years ago
- ☆34Updated 5 years ago
- ☆22Updated 3 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated last year
- Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"☆19Updated 5 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 4 years ago
- Assessing syntactic abilities of BERT☆150Updated 5 years ago
- Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers☆32Updated 3 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Updated 5 years ago
- Multi-level tagger☆23Updated 6 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆52Updated 4 years ago
- Repository for the IWCS 2017 paper "Representation Learning for Answer Selection with LSTM-Based Importance Weighting"☆28Updated 6 years ago
- ☆42Updated 5 years ago
- ☆41Updated 5 years ago
- ☆81Updated 4 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆61Updated last year
- Tools for training pytorch language models☆27Updated 3 years ago
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.☆36Updated 4 years ago
- ☆47Updated 4 years ago
- Joint Extraction & Compression text Summarization☆41Updated 5 years ago
- A curated question answering research dataset of factoid questions☆49Updated 5 years ago
- [NAACL 2019] code for "Pragmatically Informative Text Generation" https://arxiv.org/abs/1904.01301☆47Updated 4 years ago