matthew-cavener / my-bert-is-too-bigLinks

Doing Knowledge Distillation on BERT because the inference time is too damn high!

☆9

Alternatives and similar repositories for my-bert-is-too-big

Users that are interested in my-bert-is-too-big are comparing it to the libraries listed below

Sorting:

IBM / superglue-mtl
Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa
☆18Updated 5 years ago
shoarora / transformers-trainers
Tools for training pytorch language models
☆27Updated 4 years ago
alontalmor / oLMpics
☆46Updated 5 years ago
artetxem / uncovec
Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation
☆63Updated 6 years ago
amazon-science / wqa-cascade-transformers
☆22Updated 3 years ago
mukhal / fairseq-tagging
a Fairseq fork for sequence tagging/labeling tasks
☆31Updated 5 years ago
noisemix / noisemix
NoiseMix - data generation for natural language
☆40Updated 7 years ago
mandarjoshi90 / pair2vec
pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
☆62Updated 2 years ago
boknilev / nlp-analysis-methods
Companion site for "Analysis Methods in Neural Language Processing: A Survey"
☆66Updated 5 years ago
huggingface / adversarialnlp
A generic library for crafting adversarial NLP examples - WIP
☆41Updated 6 years ago
chinnadhurai / ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
☆36Updated 4 years ago
nyu-dl / dl4mt-seqgen
☆31Updated 6 years ago
tatsuokun / context2vec
PyTorch implementation of context2vec from Melamud et al., CoNLL 2016
☆19Updated 6 years ago
ZeweiChu / MQR
☆20Updated 5 years ago
golsun / SpaceFusion
NAACL'19: "Jointly Optimizing Diversity and Relevance in Neural Response Generation"
☆74Updated 4 years ago
yoavg / bert-syntax
Assessing syntactic abilities of BERT
☆148Updated 6 years ago
carolinlawrence / BiSon
Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.
☆51Updated 5 years ago
MiuLab / DuaLUG
The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…
☆66Updated 4 years ago
facebookresearch / TreeNLG
A novel method of constrained decoding for neural NLG (NNLG) models
☆83Updated 4 years ago
gcunhase / StackedDeBERT
Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)
☆32Updated 2 years ago
hassyGo / charNgram2vec
Pre-training character n-gram embeddings
☆22Updated last year
ghaddarAbs / NER-with-LS
☆35Updated 3 years ago
zbloss / reformer_lm
a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)
☆53Updated 2 years ago
sumanbanerjee1 / Code-Mixed-Dialog
☆33Updated 7 years ago
mega002 / annotator_bias
The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandin…
☆21Updated 5 years ago
seominjoon / piqa
Phrase-Indexed Question Answering (PIQA)
☆94Updated 6 years ago
uclanlp / NamedEntityLanguageModel
☆32Updated 6 years ago
allenai / allennlp-template-python-script
A template for starting an allennlp project using a python script instead of config files
☆27Updated last year
ShaojieJiang / FACE
Official implementation of the models proposed in paper "Improving Neural Response Diversity with Frequency-Aware Cross-Entropy Loss"
☆19Updated 6 years ago
stefan-it / fine-tuned-berts-seq
Fine-tuned Transformers compatible BERT models for Sequence Tagging
☆40Updated 4 years ago