zju-SWJ / RLD
Official implementation for "Knowledge Distillation with Refined Logits".
☆13Updated 4 months ago
Alternatives and similar repositories for RLD:
Users that are interested in RLD are comparing it to the libraries listed below
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning☆38Updated 5 months ago
- [BMVC 2022] Information Theoretic Representation Distillation☆18Updated last year
- ☆25Updated last year
- The official implementation of LumiNet: The Bright Side of Perceptual Knowledge Distillation https://arxiv.org/abs/2310.03669☆20Updated 10 months ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- [NeurIPS'22] Projector Ensemble Feature Distillation☆29Updated last year
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆19Updated last year
- GIFT: Generative Interpretable Fine-Tuning☆19Updated 3 months ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆58Updated 3 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆26Updated last year
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated last year
- Official implementation of NeurIPS 2024 "Visual Fourier Prompt Tuning"☆23Updated 2 weeks ago
- Official code for Scale Decoupled Distillation☆37Updated 9 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated 3 months ago
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆37Updated last year
- BESA is a differentiable weight pruning technique for large language models.☆14Updated 10 months ago
- The codebase for paper "PPT: Token Pruning and Pooling for Efficient Vision Transformer"☆20Updated 2 months ago
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆50Updated 2 months ago
- ☆26Updated 2 years ago
- [NeurIPS 2024] Search for Efficient LLMs☆12Updated this week
- ☆20Updated 2 years ago
- Official implementation of ParCNetV2☆10Updated 10 months ago
- ☆42Updated last year
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆20Updated last year
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆26Updated 10 months ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆15Updated this week
- The offical implement of ImbSAM (Imbalanced-SAM)☆23Updated 10 months ago
- [ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.☆65Updated last year
- The official project website of "Small Scale Data-Free Knowledge Distillation" (SSD-KD for short, accepted to CVPR 2024).☆15Updated 7 months ago
- ☆21Updated 3 years ago