iurada / px-ntk-pruningLinks
Official repository of our work "Finding Lottery Tickets in Vision Models via Data-driven Spectral Foresight Pruning" accepted at CVPR 2024
☆25Updated 8 months ago
Alternatives and similar repositories for px-ntk-pruning
Users that are interested in px-ntk-pruning are comparing it to the libraries listed below
Sorting:
- [ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Hua…☆65Updated 2 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆70Updated last year
- 这里陈列了我编写的一些关于深度学习的画图工具,如果觉得有帮助可以给个star.☆42Updated 3 years ago
- [AAAI 2023] Official PyTorch Code for "Curriculum Temperature for Knowledge Distillation"☆179Updated 10 months ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆249Updated 2 years ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆90Updated last year
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆131Updated last year
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆62Updated last year
- ☆27Updated 2 years ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆101Updated 2 years ago
- ImageNet-1K data download, processing for using as a dataset☆114Updated 2 years ago
- Awesome-Low-Rank-Adaptation☆116Updated last year
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆100Updated last year
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆46Updated 2 years ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 3 years ago
- [ICLR 2024 Oral] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated last year
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆84Updated 2 years ago
- The official implementation for paper: Improving Knowledge Distillation via Regularizing Feature Norm and Direction☆22Updated 2 years ago
- ☆28Updated last year
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆84Updated last year
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022