jinqixiao / ComCAT
☆16Updated 7 months ago
Related projects: ⓘ
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆22Updated 6 months ago
- ☆39Updated 6 months ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆10Updated 5 months ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆21Updated 6 months ago
- ☆19Updated 3 years ago
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆35Updated last year
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆29Updated last year
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆22Updated 7 months ago
- ☆16Updated last year
- PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆27Updated 2 weeks ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆66Updated last year
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆24Updated 2 years ago
- ☆43Updated 5 months ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆26Updated 2 months ago
- ☆37Updated 7 months ago
- Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.☆19Updated 3 years ago
- ☆52Updated last year
- [ICDM 2023] Momentum is All You Need for Data-Driven Adaptive Optimization☆22Updated 5 months ago
- Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)☆21Updated 3 months ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆49Updated 3 months ago
- A torch-based implementation of K-Means and K-Means++☆17Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 2 years ago
- Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆56Updated 2 months ago
- ☆11Updated 3 months ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆15Updated 2 months ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆16Updated 2 years ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆24Updated 6 months ago
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆21Updated 4 months ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆76Updated 8 months ago