cxa-unique / Simplified-TinyBERT

ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval

☆17

Alternatives and similar repositories for Simplified-TinyBERT

Users that are interested in Simplified-TinyBERT are comparing it to the libraries listed below

Sorting:

xiamengzhou / NLPerf
Performance Prediction for NLP Tasks
☆16Updated 5 years ago
thunlp / Knowledge-Inheritance
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
☆38Updated 3 years ago
intersun / CoDIR
Code for EMNLP 2020 paper CoDIR
☆41Updated 2 years ago
allenai / staged-training
Staged Training for Transformer Language Models
☆32Updated 3 years ago
microsoft / AMOS
[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators
☆24Updated last year
allenai / unifew
Unifew: Unified Fewshot Learning Model
☆18Updated 3 years ago
TevenLeScao / pet
This repository contains the code for "How many data points is a prompt worth?"
☆48Updated 4 years ago
bigganbing / Fairseq_MorphTE
[NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings
☆17Updated 2 years ago
MurtyShikhar / ExpBERT
Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"
☆29Updated 4 years ago
fuzihaofzh / repetition-problem-nlg
Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.
☆53Updated 2 years ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆56Updated 2 years ago
mstrise / seq2label-crossrep
Sequence Labeling Parsing by Learning Across Representations
☆13Updated 5 years ago
HLR / TSLM
The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"
☆11Updated 4 years ago
salesforce / FactLM
☆10Updated 3 years ago
twinkle0331 / Xcompression
[ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)
☆22Updated last year
salesforce / Overture
Library for soft prompt tuning
☆23Updated last year
ThomasScialom / T0_continual_learning
Adding new tasks to T0 without catastrophic forgetting
☆33Updated 2 years ago
renll / SparseLT
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Updated 2 years ago
cliang1453 / SAGE
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
☆30Updated 3 years ago
Noahs-ARK / PaLM
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Updated 5 years ago
Noahs-ARK / GroC
Pytorch implementation of models described in "Grounded compositional outputs for adaptive language modeling", EMNLP 2020.
☆18Updated 3 years ago
XinyuHua / textgen-emnlp19
Code for our EMNLP 2019 paper titled "Sentence-Level Content Planning and Style Specification for Neural Text Generation"
☆17Updated 5 years ago
cliang1453 / super-structured-lottery-tickets
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
☆17Updated 3 years ago
Yifan-Gao / open_retrieval_conversational_machine_reading
Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset
☆13Updated 2 years ago
castorini / berxit
☆22Updated 4 years ago
LeeSureman / Sequence-Labeling-Early-Exit
Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit
☆28Updated 2 years ago
RUCAIBox / ELMER
This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…
☆26Updated 2 years ago
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated 3 weeks ago
StonyBrookNLP / teabreac
Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22
☆19Updated last year
jxhe / sparse-text-prototype
PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"
☆22Updated 3 years ago