Somedaywilldo / BM-NASLinks
BM-NAS: Bilevel Multimodal Neural Architecture Search (AAAI 2022 Oral)
☆18Updated 2 years ago
Alternatives and similar repositories for BM-NAS
Users that are interested in BM-NAS are comparing it to the libraries listed below
Sorting:
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆64Updated 3 years ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- ☆24Updated 3 years ago
- ☆27Updated 2 years ago
- ☆27Updated 2 years ago
- Code for Paper "Self-Distillation from the Last Mini-Batch for Consistency Regularization"☆41Updated 2 years ago
- codes for Neural Architecture Ranker and detailed cell information datasets based on NAS-Bench series☆12Updated 3 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆67Updated 9 months ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆45Updated 2 years ago
- ☆46Updated last year
- AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation☆60Updated 4 years ago
- [CVPR '23] PA&DA: Jointly Sampling PAth and DAta for Consistent NAS☆36Updated 2 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- [TPAMI-2023] Official implementations of L-MCL: Online Knowledge Distillation via Mutual Contrastive Learning for Visual Recognition☆24Updated 2 years ago
- ☆27Updated 2 years ago
- Code for our ICLR'2022 paper "Generalizing Few-Shot NAS with Gradient Matching"☆22Updated 2 years ago
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆37Updated 2 years ago
- [ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…☆73Updated 2 years ago
- The codebase for paper "PPT: Token Pruning and Pooling for Efficient Vision Transformer"☆24Updated 8 months ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆24Updated last year
- [ICCV 2021] Official implementation of "Scalable Vision Transformers with Hierarchical Pooling"☆33Updated 3 years ago
- Official Codes☆19Updated last year
- ☆51Updated 10 months ago
- ☆22Updated 3 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆66Updated 2 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆29Updated 3 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆83Updated last year
- A simple reimplement Online Knowledge Distillation via Collaborative Learning with pytorch☆49Updated 2 years ago
- [CVPR 2021] CondenseNet V2: Sparse Feature Reactivation for Deep Networks☆86Updated 2 years ago