[ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe
☆30Sep 2, 2024Updated last year
Alternatives and similar repositories for optin-transformer-pruning
Users that are interested in optin-transformer-pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prioritize Alignment in Dataset Distillation☆21Dec 3, 2024Updated last year
- A repo for publishing solution to 3DCoMPaT++ challenge on an improved large-scale 3D vision dataset for compositional recognition☆14Jun 22, 2023Updated 2 years ago
- ☆23Nov 16, 2024Updated last year
- Fast, memory-efficient attention column reduction (e.g., sum, mean, max)☆44Feb 10, 2026Updated 2 months ago
- Official code for paper "OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning"☆11Jun 19, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Loss Function Search for Face Recognition☆41Jan 9, 2021Updated 5 years ago
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆27Jul 28, 2025Updated 8 months ago
- Code for CVPR24 Paper - Resource-Efficient Transformer Pruning for Finetuning of Large Models☆12Oct 31, 2025Updated 5 months ago
- [WACV 2025] Official code release for Transientangelo: Few-Viewpoint Surface Reconstruction Using Single-Photon Lidar☆20Oct 29, 2024Updated last year
- ☆19Apr 22, 2022Updated 3 years ago
- ☆21Apr 23, 2025Updated 11 months ago
- 아주대학교 연습용 수강신청 사이트입니다 :] (로컬 버전)☆13Feb 2, 2026Updated 2 months ago
- model-compression-and-acceleration-4-DNN☆21Nov 29, 2018Updated 7 years ago
- Dataset Quantization with Active Learning based Adaptive Sampling [ECCV 2024]☆10Jul 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PyTorch implementation of "Learning from Students: Online Contrastive Distillation Network for General Continual Learning" (IJCAI 2022)☆11Dec 29, 2022Updated 3 years ago
- PyTorch Implementation of Self-Supervised Learning models☆13Apr 25, 2021Updated 4 years ago
- Official PyTorch implementation of "LGViT: Dynamic Early Exiting for Accelerating Vision Transformer" (ACM MM 2023)☆16Nov 18, 2024Updated last year
- A repository to keep track of literature on catastrophic forgetting☆37Mar 10, 2020Updated 6 years ago
- mycloudhome is a cli tool for Western Digital MY CLOUD HOME☆20Feb 24, 2022Updated 4 years ago
- This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"☆19Feb 19, 2023Updated 3 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated 11 months ago
- ☆11Jul 21, 2023Updated 2 years ago
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆16Mar 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Miro[ACM MobiCom '23] Cost-effective On-device Continual Learning over Memory Hierarchy with Miro☆16Feb 1, 2024Updated 2 years ago
- PyTorch code for our CoLLAs-2022 paper "Online Continual Learning for Embedded Devices"☆13Aug 4, 2022Updated 3 years ago
- Code for Improving Task-free Continual Learning by Distributionally Robust Memory Evolution (ICML 2022)☆11Aug 20, 2022Updated 3 years ago
- Efficient and Online Dataset Growth Algorithm (with cleanness and diversity awareness) to deal with growing web data☆21Aug 6, 2024Updated last year
- Official Implementation of paper "Distilling Long-tailed Datasets" [CVPR 2025]☆21Aug 13, 2025Updated 8 months ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆33Apr 23, 2023Updated 2 years ago
- ☆30Feb 27, 2025Updated last year
- Recommendation System using Deep Q-Networks and Double Deep Q-Networks☆13May 23, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…☆14Feb 25, 2026Updated last month
- Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020☆17Dec 8, 2022Updated 3 years ago
- [AAAI2024] Summarizing Stream Data for Memory-Restricted Online Continual Learning☆21Apr 30, 2024Updated last year
- A whisper repo for TPU☆11Jun 4, 2024Updated last year
- Code release for "Saving 100x Storage: Prototype Replay for Reconstructing Training Sample Distribution in Class-Incremental Semantic Seg…☆20Mar 19, 2025Updated last year
- simple and efficient baselines for practical semantic segmentation with plain ViTs☆20Mar 9, 2024Updated 2 years ago