VITA-Group/EarlyBERT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VITA-Group/EarlyBERT)

VITA-Group / EarlyBERT

[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, Zhangyang Wang and Jingjing Liu

☆18

Alternatives and similar repositories for EarlyBERT

Users that are interested in EarlyBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GATECH-EIC / SuperTickets
View on GitHub
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆20Jul 7, 2022Updated 4 years ago
LinyangLee / Token-Aware-VAT
View on GitHub
Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.
☆25Dec 3, 2020Updated 5 years ago
ruizheng20 / robust_ticket
View on GitHub
Code of Robust Lottery Tickets for Pre-trained Language Models (ACL2022)
☆20Jul 18, 2022Updated 4 years ago
INK-USC / hypter
View on GitHub
Zero-shot Learning by Generating Task-specific Adapters
☆14Apr 2, 2021Updated 5 years ago
VITA-Group / instant_soup
View on GitHub
[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…
☆11Nov 28, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
allenai / hyper-task-descriptions
View on GitHub
Learning adapter weights from task descriptions
☆20Nov 12, 2023Updated 2 years ago
prateeky2806 / ComPEFT
View on GitHub
☆26Nov 23, 2023Updated 2 years ago
MANGA-UOFA / PTfer
View on GitHub
☆11Nov 13, 2024Updated last year
VITA-Group / Lifelong-Learning-LTH
View on GitHub
[ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…
☆26Dec 30, 2021Updated 4 years ago
qinliu9 / Flooding-X
View on GitHub
☆14Jul 13, 2022Updated 4 years ago
HarlynDN / WebCiteS
View on GitHub
[ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations
☆13Sep 11, 2024Updated last year
SAP-archive / acl2020-commonsense
View on GitHub
Source code for paper on commonsense reasoning for 2020 Annual Conference of the Association for Computational Linguistics (ACL) 2020.
☆29Aug 2, 2024Updated last year
MajorDavidZhang / MCL
View on GitHub
code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning
☆20Jul 16, 2024Updated 2 years ago
VITA-Group / BERT-Tickets
View on GitHub
[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…
☆141Dec 30, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
intersun / CoDIR
View on GitHub
Code for EMNLP 2020 paper CoDIR
☆41Oct 4, 2022Updated 3 years ago
VITA-Group / Diverse-ViT
View on GitHub
[CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…
☆25Mar 9, 2022Updated 4 years ago
tencent-ailab / ICML21_OAXE
View on GitHub
☆28Sep 28, 2021Updated 4 years ago
ArminAzizi98 / LaMDA
View on GitHub
☆15Nov 7, 2024Updated last year
GATECH-EIC / torchshiftadd
View on GitHub
An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.
☆14Feb 3, 2025Updated last year
TsinghuaAI / TDS
View on GitHub
A plug-in of Microsoft DeepSpeed to fix the bug of DeepSpeed pipeline
☆25Apr 16, 2021Updated 5 years ago
WantD998 / ICC-vx2478-4k
View on GitHub
校色文件
☆12Aug 27, 2020Updated 5 years ago
zhuchen03 / FreeLB
View on GitHub
Adversarial Training for Natural Language Understanding
☆252Sep 6, 2023Updated 2 years ago
RahulSChand / Weighted-low-rank-factorization-Pytorch
View on GitHub
PyTorch implementation of Language model compression with weighted low-rank factorization
☆14Jun 28, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Torment123 / DFS
View on GitHub
☆15Jan 8, 2020Updated 6 years ago
xydaytoy / EVA
View on GitHub
☆14Apr 16, 2024Updated 2 years ago
AlanAnsell / peft
View on GitHub
☆22Jul 5, 2024Updated 2 years ago
ych133 / How2R-and-How2QA
View on GitHub
A video retrieval dataset How2R and a video QA dataset How2QA
☆24Oct 15, 2020Updated 5 years ago
llyx97 / TAMT
View on GitHub
[NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…
☆15Oct 18, 2022Updated 3 years ago
AwesomeTang / TensorFLow-MINST
View on GitHub
使用CNN、LSTM等实现MNIST分类，长期更新🚵‍♀️🚵‍♀️🚵‍♀️
☆13May 2, 2019Updated 7 years ago
jiaxue-ai / GTN
View on GitHub
PyTorch implementation for "Gated Transfer Network for Transfer Learning"
☆11Jun 3, 2019Updated 7 years ago
TIGER-AI-Lab / VISTA
View on GitHub
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆20Feb 27, 2025Updated last year
TanayNarshana / DFPC-Pruning
View on GitHub
[ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.
☆15Aug 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GATECH-EIC / FracTrain
View on GitHub
[NeurIPS 2020] "FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training" by Yonggan Fu, Ha…
☆10Feb 13, 2022Updated 4 years ago
peterbhase / ExplanationRoles
View on GitHub
Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"
☆14Feb 16, 2021Updated 5 years ago
Kampi / Zybo-Linux
View on GitHub
A complete Linux project for the ZYBO. This project helps me during my first steps with embedded Linux. You can find anything necessary t…
☆13Oct 8, 2020Updated 5 years ago
XMUDeepLIT / QGC
View on GitHub
Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)
☆20Jun 12, 2024Updated 2 years ago
NonvolatileMemory / flash_attn_gqa
View on GitHub
triton ver of gqa flash attn, based on the tutorial
☆12Aug 4, 2024Updated last year
Mikivishy / FullFront
View on GitHub
The official code repository for the FullFront benchmark
☆27May 16, 2025Updated last year
lucidrains / distilled-retriever-pytorch
View on GitHub
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Dec 16, 2020Updated 5 years ago