cxa-unique / Simplified-TinyBERTLinks
ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval
☆17Updated 4 years ago
Alternatives and similar repositories for Simplified-TinyBERT
Users that are interested in Simplified-TinyBERT are comparing it to the libraries listed below
Sorting:
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Updated 3 years ago
- Code for EMNLP 2020 paper CoDIR☆41Updated 3 years ago
- [NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings☆17Updated 3 years ago
- Code for the paper "A Theoretical Analysis of the Repetition Problem in Text Generation" in AAAI 2021.☆57Updated 3 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Updated 3 years ago
- ☆21Updated 4 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Updated 3 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Updated 4 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 3 years ago
- The sources codes of the DR-BERT model and baselines☆38Updated 4 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆52Updated 4 years ago
- Lite Self-Training☆30Updated 2 years ago
- WebRED is a large and diverse manually annotated dataset for extracting relationships from a variety of text found on the World Wide Web.☆22Updated 4 years ago
- Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.☆20Updated 3 years ago
- Unifew: Unified Fewshot Learning Model☆18Updated 4 years ago
- ☆67Updated 4 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Updated 4 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Updated 3 years ago
- Code for ACL 2021 main conference paper "Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue Utterances".☆94Updated 4 years ago
- This repository contains the code for "How many data points is a prompt worth?"☆48Updated 4 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48Updated 3 years ago
- ☆23Updated 5 years ago
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆20Updated 4 years ago
- DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization (ACL 2022)☆50Updated 2 years ago
- ☆54Updated 8 years ago
- EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering☆68Updated 4 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Updated 5 years ago
- Transformers at any scale☆42Updated 2 years ago
- Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)☆19Updated 4 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Updated 3 years ago