[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, Zhangyang Wang and Jingjing Liu
☆18Dec 30, 2021Updated 4 years ago
Alternatives and similar repositories for EarlyBERT
Users that are interested in EarlyBERT are comparing it to the libraries listed below
Sorting:
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Dec 3, 2020Updated 5 years ago
- ☆14Mar 18, 2022Updated 3 years ago
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 4 years ago
- Learning adapter weights from task descriptions☆19Nov 12, 2023Updated 2 years ago
- Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)☆20Jul 18, 2022Updated 3 years ago
- ☆26Nov 23, 2023Updated 2 years ago
- A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline☆25Apr 16, 2021Updated 4 years ago
- Official Implementation of PL-FMS☆11Sep 30, 2023Updated 2 years ago
- ☆28Sep 28, 2021Updated 4 years ago
- This is implementation of the paper 'Toward Diverse Text Generation with Inverse Reinforcement Learning' https://arxiv.org/abs/1804.11258…☆34Nov 29, 2018Updated 7 years ago
- Source code for paper on commonsense reasoning for 2020 Annual Conference of the Association for Computational Linguistics (ACL) 2020.☆29Aug 2, 2024Updated last year
- This is the pytorch implementation of the long paper on ACL 2020: A Self-Training Method for Machine Reading Comprehension with Soft Evid…☆34Aug 14, 2020Updated 5 years ago
- [NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…☆142Dec 30, 2021Updated 4 years ago
- FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension☆35Oct 4, 2022Updated 3 years ago
- A Python package for Data Interchange for Geotechnical and Geoenvironmental Specialists (DIGGS).☆11Feb 7, 2025Updated last year
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- A friendly, afternoon introduction to html and css.☆21Jul 18, 2014Updated 11 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Broadcom Robo Chip firmware cross compilation☆11Oct 16, 2018Updated 7 years ago
- Social coding with Git and GitHub.☆18Mar 21, 2015Updated 10 years ago
- A tutorial to help you make the move to GitHub☆10Jun 2, 2023Updated 2 years ago
- Code for ACL 2022 paper "BERT Learns to Teach: Knowledge Distillation with Meta Learning".☆87Aug 4, 2022Updated 3 years ago
- golang tun nat☆11Jul 20, 2022Updated 3 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- Machine Learning Reading Group☆11Sep 15, 2023Updated 2 years ago
- ☆12Feb 19, 2025Updated last year
- CommonsenseQA☆10Mar 28, 2020Updated 5 years ago
- BachDuet enables a human performer to improvise a duet counterpoint with a computer agent in real time.☆14Aug 8, 2022Updated 3 years ago
- <Img src={url}> - Image processing/resizing, CDNs, blur-in, progressive/lazy-loading superpowers.☆13Jan 31, 2018Updated 8 years ago
- Liste des Régions, Districts, Communes et Fokontany☆13Mar 2, 2020Updated 6 years ago
- configs for my eclipse IDE (sts)☆12Dec 9, 2025Updated 2 months ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Canvas that explode after a click. (点击后粒子爆炸喷墨)☆10Jan 22, 2025Updated last year
- ☆11Nov 13, 2024Updated last year
- Microbiome Analysis Plotting and Visualization☆12Updated this week
- ☆10Aug 18, 2022Updated 3 years ago
- Pytorch implementation of HCNAF: Hyper-Conditioned Neural Autoregressive Flow (CVPR 2020)☆15Jun 14, 2020Updated 5 years ago