Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICCV2023/html/Zhao_Cumulative_Spatial_Knowledge_Distillation_for_Vision_Transformers_ICCV_2023_paper.html
☆15Nov 5, 2023Updated 2 years ago
Alternatives and similar repositories for CSKD
Users that are interested in CSKD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆138Apr 19, 2024Updated 2 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆34Jul 18, 2023Updated 2 years ago
- YOLOv8 Knowledge Distillation☆10Dec 28, 2024Updated last year
- Knowledge Extraction with No Observable Data (NeurIPS 2019)☆46Jan 9, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of DeepMind's "Sobolev Training for Neural Networks"☆11Apr 2, 2018Updated 8 years ago
- ☆13Mar 10, 2023Updated 3 years ago
- A small demo for training cnn with pytorch.☆11Dec 15, 2018Updated 7 years ago
- Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"☆36Dec 5, 2023Updated 2 years ago
- An open-source implementaion for fine-tuning DINOv2 by Meta.☆14Jul 21, 2025Updated 11 months ago
- Resources for paper: "NeAT: Neural Artistic Tracing for Beautiful Style Transfer"☆13Apr 11, 2023Updated 3 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- Uses C-GAN for feature hallucination of missing modalities for hyperspectral data. TensorFlow implementation of ICCV '19 paper☆11Sep 9, 2020Updated 5 years ago
- ☆13Jul 19, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆16Sep 14, 2023Updated 2 years ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated 2 years ago
- Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object☆19Dec 1, 2024Updated last year
- ☆15Oct 6, 2020Updated 5 years ago
- Incremental Object Detection with Feature Pyramid Network(FPN) and Knowledge Distillation.☆12Jan 16, 2025Updated last year
- ☆10Dec 9, 2021Updated 4 years ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆16Sep 6, 2024Updated last year
- Running Large Language Model easily.☆13Jun 20, 2026Updated last week
- An updated PyTorch implementation of hengyuan-hu's version for 'Bottom-Up and Top-Down Attention for Image Captioning and Visual Question…☆34Mar 13, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for the CVPR'23 paper: "STMT: A Spatial-Temporal Mesh Transformer for MoCap-Based Action Recognition"☆21Dec 9, 2024Updated last year
- RAST 1.0: Restorable Arbitrary Style Transfer via Multi-restoration☆13Jun 18, 2024Updated 2 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"☆20Aug 30, 2021Updated 4 years ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 4 years ago
- Code of Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint☆21Oct 23, 2023Updated 2 years ago
- Code for CVPR24 Paper - Resource-Efficient Transformer Pruning for Finetuning of Large Models☆12Oct 31, 2025Updated 8 months ago
- Code for reproducing meta-learning for cross-lingual transfer learning in NLU and QA☆13Aug 17, 2021Updated 4 years ago
- Pytorch reproduction of Peer Collaborative Learning for Online Knowledge Distillation, AAAI2021☆21May 28, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)☆14Jan 8, 2023Updated 3 years ago
- Official Pytorch code for "AesUST: Towards Aesthetic-Enhanced Universal Style Transfer" (ACM MM 2022)☆15Dec 31, 2022Updated 3 years ago
- Official Pytorch implementation for Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion (WACV 2024).☆13Dec 24, 2024Updated last year
- [AAAI-2025 Oral] Official implementation of Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition☆43Jan 13, 2025Updated last year
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 7 months ago
- ☆13Sep 24, 2023Updated 2 years ago
- ☆17Oct 7, 2022Updated 3 years ago