[Neurips 2022] “ Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropogation”, Ziyu Jiang*, Xuxi Chen*, Xueqin Huang, Xianzhi Du, Denny Zhou, Zhangyang Wang
☆19Mar 14, 2023Updated 3 years ago
Alternatives and similar repositories for BackRazor_Neurips22
Users that are interested in BackRazor_Neurips22 are comparing it to the libraries listed below
Sorting:
- This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".☆120Dec 12, 2021Updated 4 years ago
- [NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…☆15Oct 18, 2022Updated 3 years ago
- ☆20Dec 16, 2020Updated 5 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆92Mar 16, 2023Updated 3 years ago
- Prior Knowledge Guided Unsupervised Domain Adaptation (ECCV 2022)☆17Sep 6, 2022Updated 3 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- Exploring Lottery Ticket Hypothesis in Sparse Spiking Neural Networks (ECCV2022, oral presentation)☆36Jul 18, 2022Updated 3 years ago
- ☆29Dec 5, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Updated this week
- 一个基于Python的智能量化虚拟货币交易分析工具,支持okx和binance,集成持仓分析、网格策略、机器学习预测等功能,为交易决策提供全方位支持。☆36Oct 24, 2025Updated 4 months ago
- PyTorch implementation of "Learning from Students: Online Contrastive Distillation Network for General Continual Learning" (IJCAI 2022)☆11Dec 29, 2022Updated 3 years ago
- Official PyTorch implementation of "LGViT: Dynamic Early Exiting for Accelerating Vision Transformer" (ACM MM 2023)☆15Nov 18, 2024Updated last year
- Randomized algorithm class at CU☆15Jul 8, 2025Updated 8 months ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- [ICASSP2022] RATE CODING OR DIRECT CODING: WHICH ONE IS BETTER FOR ACCURATE, ROBUST, and ENERGY-EFFICIENT SPIKING NEURAL NETWORKS☆20Sep 26, 2023Updated 2 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Feb 16, 2023Updated 3 years ago
- ☆20Dec 14, 2023Updated 2 years ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- ☆11Jul 21, 2023Updated 2 years ago
- This is official code implementation of the <Revisiting Neural Networks for Continual Learning: An Architectural Perspective> in IJCAI 20…☆13Nov 25, 2024Updated last year
- ☆12Jul 6, 2022Updated 3 years ago
- ☆14Feb 2, 2021Updated 5 years ago
- Miro[ACM MobiCom '23] Cost-effective On-device Continual Learning over Memory Hierarchy with Miro☆16Feb 1, 2024Updated 2 years ago
- [ICASSP 2022] Official PyTorch Implementation for "Attention Probe: Vision Transformer Distillation in the Wild" (ICASSP 2022)☆11Jan 23, 2022Updated 4 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆32Apr 8, 2023Updated 2 years ago
- PyTorch code for our CoLLAs-2022 paper "Online Continual Learning for Embedded Devices"☆13Aug 4, 2022Updated 3 years ago
- Code for Improving Task-free Continual Learning by Distributionally Robust Memory Evolution (ICML 2022)☆11Aug 20, 2022Updated 3 years ago
- ☆14Oct 30, 2021Updated 4 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆20Oct 10, 2022Updated 3 years ago
- Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …☆35Jun 3, 2025Updated 9 months ago
- Mutual attention model for matching QA pairs in dialogues☆11Sep 20, 2020Updated 5 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆45Nov 11, 2023Updated 2 years ago
- Code for the paper "On the Road to Online Adaptation for Semantic Image Segmentation", CVPR 2022☆29Oct 18, 2022Updated 3 years ago
- ☆11Jul 31, 2022Updated 3 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- Code for NeurIPS 2021 paper "Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning".☆16Oct 18, 2021Updated 4 years ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆70Jul 2, 2025Updated 8 months ago
- ☆20Updated this week