VITA-Group/BERT-Tickets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VITA-Group/BERT-Tickets)

VITA-Group / BERT-Tickets

[NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Zhangyang Wang, Michael Carbin

☆141

Alternatives and similar repositories for BERT-Tickets

Users that are interested in BERT-Tickets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

llyx97 / TAMT
View on GitHub
[NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…
☆15Oct 18, 2022Updated 3 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
VITA-Group / L2O-Minimax
View on GitHub
[ICLR 2021] "Learning a Minimax Optimizer: A Pilot Study" by Jiayi Shen*, Xiaohan Chen*, Howard Heaton*, Tianlong Chen, Jialin Liu, Wotao…
☆15Dec 30, 2021Updated 4 years ago
VITA-Group / Junk_DNA_Hypothesis
View on GitHub
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Apr 21, 2025Updated last year
princeton-nlp / DinkyTrain
View on GitHub
Princeton NLP's pre-training library based on fairseq with DeepSpeed kernel integration 🚃
☆117Oct 27, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
huggingface / block_movement_pruning
View on GitHub
Block Sparse movement pruning
☆83Nov 26, 2020Updated 5 years ago
UriSha / EmbeddinglessNMT
View on GitHub
The implementation of "Neural Machine Translation without Embeddings", NAACL 2021
☆33Jun 9, 2021Updated 5 years ago
naver-ai / MetricMT
View on GitHub
The official code repository for MetricMT - a reward optimization method for NMT with learned metrics
☆25Apr 24, 2021Updated 5 years ago
iedwardwangi / MetaAdapter
View on GitHub
☆22Apr 21, 2021Updated 5 years ago
VITA-Group / PrAC-LTH
View on GitHub
[ICML 2021] "Efficient Lottery Ticket Finding: Less Data is More" by Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang
☆26Dec 30, 2021Updated 4 years ago
LiyuanLucasLiu / Transformer-Clinic
View on GitHub
Understanding the Difficulty of Training Transformers
☆332May 31, 2022Updated 4 years ago
VITA-Group / EarlyBERT
View on GitHub
[ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …
☆18Dec 30, 2021Updated 4 years ago
VITA-Group / CV_LTH_Pre-training
View on GitHub
[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…
☆69Dec 17, 2022Updated 3 years ago
ptlmasking / maskbert
View on GitHub
☆20Dec 16, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
VITA-Group / SViTE
View on GitHub
[NeurIPS'21] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang…
☆87Dec 1, 2023Updated 2 years ago
mitchellgordon95 / bert-prune
View on GitHub
☆17May 14, 2020Updated 6 years ago
namisan / exdeep-nmt
View on GitHub
☆32Sep 27, 2021Updated 4 years ago
cliang1453 / super-structured-lottery-tickets
View on GitHub
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
☆19Jul 28, 2021Updated 4 years ago
VITA-Group / Lifelong-Learning-LTH
View on GitHub
[ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…
☆26Dec 30, 2021Updated 4 years ago
cheneydon / efficient-bert
View on GitHub
This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …
☆32Jun 14, 2023Updated 3 years ago
facebookresearch / open_lth
View on GitHub
A repository in preparation for open-sourcing lottery ticket hypothesis code.
☆640Sep 6, 2022Updated 3 years ago
llyx97 / Rosita
View on GitHub
[AAAI 2021] "ROSITA: Refined BERT cOmpreSsion with InTegrAted techniques", Yuanxin Liu, Zheng Lin, Fengcheng Yuan
☆14Oct 18, 2022Updated 3 years ago
harvardnlp / cascaded-generation
View on GitHub
Cascaded Text Generation with Markov Transformers
☆130Mar 20, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mingdachen / disentangle-semantics-syntax
View on GitHub
Code for "A Multi-Task Approach for Disentangling Syntax and Semantics in Sentence Representations" (NAACL 2019)
☆67Mar 5, 2021Updated 5 years ago
seilna / CNN-Units-in-NLP
View on GitHub
Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs
☆26Mar 9, 2019Updated 7 years ago
microsoft / MetaXL
View on GitHub
Meta Representation Transformation for Low-resource Cross-lingual Learning
☆41May 5, 2021Updated 5 years ago
yagays / glyph-aware-character-embedding
View on GitHub
☆14Aug 28, 2018Updated 7 years ago
sai-prasanna / bert-experiments
View on GitHub
☆19Oct 6, 2020Updated 5 years ago
lifu-tu / ENGINE
View on GitHub
ENGINE: Energy-Based Inference Networks for Non-Autoregressive Machine Translation
☆25Oct 2, 2020Updated 5 years ago
yoonkim / neural-qcfg
View on GitHub
☆45Oct 11, 2021Updated 4 years ago
princeton-nlp / CoFiPruning
View on GitHub
[ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408
☆199May 9, 2023Updated 3 years ago
monologg / kakaotrans
View on GitHub
[Unofficial] Kakaotrans: Kakao translate API for python
☆16Mar 29, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
MadhumithaKannan / linear-regression-using-only-numpy
View on GitHub
Implementation of unregularized, l1 regularized and l2 regularized linear regression using numpy and without sklearn
☆11Oct 4, 2019Updated 6 years ago
shuohangwang / Cross-Thought
View on GitHub
☆47Jan 21, 2021Updated 5 years ago
VITA-Group / GAN-LTH
View on GitHub
[ICLR 2021] "GANs Can Play Lottery Too" by Xuxi Chen, Zhenyu Zhang, Yongduo Sui, Tianlong Chen
☆26Feb 18, 2022Updated 4 years ago
harvardnlp / urnng
View on GitHub
☆179Jul 31, 2020Updated 5 years ago
inspire-group / hydra
View on GitHub
Code and checkpoints of compressed networks for the paper titled "HYDRA: Pruning Adversarially Robust Neural Networks" (NeurIPS 2020) (ht…
☆91Dec 22, 2022Updated 3 years ago
princeton-nlp / OptiPrompt
View on GitHub
[NAACL 2021] Factual Probing Is [MASK]: Learning vs. Learning to Recall https://arxiv.org/abs/2104.05240
☆168Oct 7, 2022Updated 3 years ago
clovaai / length-adaptive-transformer
View on GitHub
Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)
☆102Nov 2, 2020Updated 5 years ago