Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023
☆95Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for DiffKD
Users that are interested in DiffKD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆156Dec 28, 2022Updated 3 years ago
- ☆17Aug 8, 2024Updated last year
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆10Mar 13, 2023Updated 3 years ago
- ☆21Jan 16, 2026Updated 4 months ago
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆138Apr 19, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)☆47Dec 3, 2023Updated 2 years ago
- Source Code for "Dual-Level Knowledge Distillation via Knowledge Alignment and Correlation", TNNLS, https://ieeexplore.ieee.org/abstract/…☆12Dec 21, 2022Updated 3 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Sep 18, 2023Updated 2 years ago
- Masked Generative Distillation (ECCV 2022)☆242Nov 9, 2022Updated 3 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆88Mar 20, 2024Updated 2 years ago
- tmp DPI☆14Dec 18, 2024Updated last year
- Code of Data-Free Knowledge Distillation via Feature Exchange and Activation Region Constraint☆21Oct 23, 2023Updated 2 years ago
- ☆32May 31, 2024Updated 2 years ago
- Implementation of: Hydra Attention: Efficient Attention with Many Heads (https://arxiv.org/abs/2209.07484)☆14Jan 8, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"☆36Dec 5, 2023Updated 2 years ago
- ☆22Apr 30, 2022Updated 4 years ago
- Official Implementation of DOLG (ICCV 2021)☆23Aug 3, 2021Updated 4 years ago
- ☆38Sep 3, 2023Updated 2 years ago
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆21Nov 2, 2023Updated 2 years ago
- Switchable Online Knowledge Distillation☆19Oct 27, 2024Updated last year
- The PyTorch implementation of DANN (Domain-Adversarial Training of Neural Networks).☆10Dec 4, 2023Updated 2 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆70Sep 23, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A general binary code optimization method☆13Oct 9, 2016Updated 9 years ago
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆45Jun 13, 2023Updated 3 years ago
- Anomaly localization using autoencoder models in the feature space of a ResNet☆20Feb 8, 2023Updated 3 years ago
- ☆11Nov 18, 2024Updated last year
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- Source code for the BMVC-2021 paper "SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation".☆16Jan 20, 2022Updated 4 years ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆17Dec 19, 2024Updated last year
- [NeurIPS'24] Official implementation of paper "Unveiling the Tapestry of Consistency in Large Vision-Language Models".☆39Oct 23, 2024Updated last year
- ☆18Nov 15, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Benchmarks for Macro Neural Architecture Search; used and described in the paper "Local Search is a Remarkably Strong Baseline for Neural…☆13Jul 25, 2024Updated last year
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆21Oct 22, 2025Updated 7 months ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated 2 years ago
- ☆27Jun 28, 2022Updated 3 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆50Jul 22, 2021Updated 4 years ago
- Awesome-4D-Radar☆12Feb 17, 2024Updated 2 years ago