Skhaki18/optin-transformer-pruning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Skhaki18/optin-transformer-pruning)

Skhaki18 / optin-transformer-pruning

[ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe

☆29

Alternatives and similar repositories for optin-transformer-pruning

Users that are interested in optin-transformer-pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mit-han-lab / sparserefine
View on GitHub
[ECCV 2024] SparseRefine: Sparse Refinement for Efficient High-Resolution Semantic Segmentation
☆16Jan 10, 2025Updated last year
Cattalyya / 3DCoMPaT-challenge
View on GitHub
A repo for publishing solution to 3DCoMPaT++ challenge on an improved large-scale 3D vision dataset for compositional recognition
☆14Jun 22, 2023Updated 3 years ago
tiandunx / loss_function_search
View on GitHub
Loss Function Search for Face Recognition
☆41Jan 9, 2021Updated 5 years ago
z-lab / flash-colreduce
View on GitHub
Fast, memory-efficient attention column reduction (e.g., sum, mean, max)
☆49Feb 10, 2026Updated 5 months ago
git-disl / recap
View on GitHub
Code for CVPR24 Paper - Resource-Efficient Transformer Pruning for Finetuning of Large Models
☆12Oct 31, 2025Updated 8 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
z-lab / sparselora
View on GitHub
[ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity
☆76Mar 10, 2026Updated 4 months ago
Z7zuqer / model-compression-and-acceleration-4-DNN
View on GitHub
model-compression-and-acceleration-4-DNN
☆21Nov 29, 2018Updated 7 years ago
yaolu-zjut / DDInterpreter
View on GitHub
☆15May 28, 2024Updated 2 years ago
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
joey-wang123 / DRO-Task-free
View on GitHub
Code for Improving Task-free Continual Learning by Distributionally Robust Memory Evolution (ICML 2022)
☆11Aug 20, 2022Updated 3 years ago
yfujimura / nlos-neus
View on GitHub
The official pytorch implementation of "NLOS-NeuS: Non-line-of-sight Neural Implicit Surface," ICCV2023.
☆17Sep 29, 2023Updated 2 years ago
ZyoungInc / LVGL_RK_RGA
View on GitHub
LVGL RGA/DRM Acceleration Patches for Rockchip Linux (RK3506B / RK3588 / RK356x) Hardware-accelerated LVGL (8.4 / 9.1) rendering for Roc…
☆19Oct 12, 2025Updated 9 months ago
wangbo-zhao / 2022CVPR-MMMMTBVS
View on GitHub
This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"
☆19Feb 19, 2023Updated 3 years ago
atul-1511 / RL-Recommendation-System
View on GitHub
Recommendation System using Deep Q-Networks and Double Deep Q-Networks
☆13May 23, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YaqianZhang / RepeatedAugmentedRehearsal
View on GitHub
☆11Jul 21, 2023Updated 3 years ago
radarFudan / Curse-of-memory
View on GitHub
Curse-of-memory phenomenon of RNNs in sequence modelling
☆19May 8, 2025Updated last year
zehao-wang / iPPD-sem
View on GitHub
☆21Apr 23, 2025Updated last year
omnia-postech / Miro
View on GitHub
Miro[ACM MobiCom '23] Cost-effective On-device Continual Learning over Memory Hierarchy with Miro
☆16Feb 1, 2024Updated 2 years ago
NUS-HPC-AI-Lab / Helen
View on GitHub
The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"
☆16Mar 14, 2024Updated 2 years ago
tyler-hayes / Embedded-CL
View on GitHub
PyTorch code for our CoLLAs-2022 paper "Online Continual Learning for Embedded Devices"
☆13Aug 4, 2022Updated 3 years ago
falcon-xu / LGViT
View on GitHub
Official PyTorch implementation of "LGViT: Dynamic Early Exiting for Accelerating Vision Transformer" (ACM MM 2023)
☆16Nov 18, 2024Updated last year
VITA-Group / BackRazor_Neurips22
View on GitHub
[Neurips 2022] “ Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropogation”, Ziyu Jiang*, Xuxi Chen*, Xueqin Huan…
☆19Mar 14, 2023Updated 3 years ago
NUS-HPC-AI-Lab / InfoGrowth
View on GitHub
Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data
☆20Aug 6, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OscarXZQ / delta_activations
View on GitHub
Official code release for Delta Activations: A Representation for Finetuned Large Language Models
☆20Sep 5, 2025Updated 10 months ago
HSG-AIML / NeurIPS_2022-Generative_Hyper_Representations
View on GitHub
Code Repository for the NeurIPS 2022 paper: "Hyper-Representations as Generative Models: Sampling Unseen Neural Network Weights".
☆19Jul 10, 2024Updated 2 years ago
SIJIEJI / JPTS
View on GitHub
Code for JPTS:Enhancing Deep Learning Performance of Massive MIMO CSI Feedback
☆17Jan 18, 2023Updated 3 years ago
PeihaoChen / WS-MGMap
View on GitHub
Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…
☆35Apr 23, 2023Updated 3 years ago
princetonvisualai / icons
View on GitHub
☆22Apr 24, 2025Updated last year
kimihe / Octo
View on GitHub
Create tiny ML systems for on-device learning.
☆19Jul 14, 2021Updated 5 years ago
INK-USC / GMED
View on GitHub
Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020
☆17Dec 8, 2022Updated 3 years ago
danzeng1990 / Face2Exp
View on GitHub
☆15Mar 19, 2022Updated 4 years ago
boyazeng / weight_memorization
View on GitHub
Code release for "Generative Modeling of Weights: Generalization or Memorization?"
☆23Apr 9, 2026Updated 3 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
jinpeng0528 / STAR
View on GitHub
Code release for "Saving 100x Storage: Prototype Replay for Reconstructing Training Sample Distribution in Class-Incremental Semantic Seg…
☆20Mar 19, 2025Updated last year
zlab-princeton / UEval
View on GitHub
UEval: A Benchmark for Unified Multimodal Generation
☆24Apr 20, 2026Updated 3 months ago
KAIST-Visual-AI-Group / Diffusion-Assignment4-Distillation
View on GitHub
☆28Feb 8, 2025Updated last year
tobna / TaylorShift
View on GitHub
This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…
☆15Feb 25, 2026Updated 5 months ago
Paramathic / slim
View on GitHub
SLiM: One-shot Quantized Sparse Plus Low-rank Approximation of LLMs (ICML 2025)
☆37Nov 28, 2025Updated 8 months ago
thu-nics / VGDFR
View on GitHub
VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate
☆18May 16, 2025Updated last year
lrzpellegrini / Latent-Replay
View on GitHub
Implementation of Latent Replay, a Continual Learning strategy for Real-Time / On The Edge applications
☆14May 7, 2020Updated 6 years ago