Somedaywilldo / BM-NASLinks
BM-NAS: Bilevel Multimodal Neural Architecture Search (AAAI 2022 Oral)
☆19Updated 2 years ago
Alternatives and similar repositories for BM-NAS
Users that are interested in BM-NAS are comparing it to the libraries listed below
Sorting:
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation☆62Updated 4 years ago
- [TPAMI-2023] Official implementations of L-MCL: Online Knowledge Distillation via Mutual Contrastive Learning for Visual Recognition☆26Updated 2 years ago
- codes for Neural Architecture Ranker and detailed cell information datasets based on NAS-Bench series☆12Updated 3 years ago
- ☆27Updated 2 years ago
- ☆27Updated 3 years ago
- S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)☆64Updated 4 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- ☆37Updated 3 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆149Updated 2 years ago
- Code for Paper "Self-Distillation from the Last Mini-Batch for Consistency Regularization"☆43Updated 3 years ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆16Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆69Updated last year
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆44Updated 9 months ago
- Implementation of AAAI 2022 Paper: Go wider instead of deeper☆32Updated 2 years ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆26Updated last year
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆31Updated last year
- A pure PyTorch implementation of kmeans and GMM with distributed clustering.☆48Updated 2 years ago
- Code for our ICLR'2022 paper "Generalizing Few-Shot NAS with Gradient Matching"☆22Updated 2 years ago
- ☆47Updated 2 years ago
- Code for paper Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions.☆47Updated last year
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆30Updated 3 years ago
- Information Bottleneck Approach to Spatial Attention Learning, IJCAI2021☆15Updated 4 years ago
- A pytorch implementation of paper 'Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation', …☆181Updated 3 years ago
- Code for "Dual Focal Loss for Calibration" (ICML 2023)☆32Updated 5 months ago
- PyTorch implementation for Partially View-aligned Representation Learning with Noise-robust Contrastive Loss (CVPR 2021)☆50Updated 3 years ago
- Recent Advances in MLP-based Models (MLP is all you need!)☆116Updated 2 years ago
- [AAAI 2023] Official PyTorch Code for "Curriculum Temperature for Knowledge Distillation"☆179Updated 10 months ago
- Auto-Prox-AAAI24☆13Updated last year
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆81Updated last year