hunto / ReLoss
Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022
☆37Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ReLoss
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 2 years ago
- ☆56Updated 3 years ago
- SMCA replication☆21Updated 3 years ago
- Official Codes and Pretrained Models for RecursiveMix☆22Updated last year
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆25Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated last year
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆31Updated 2 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 -- Distilling a Powerful Student Model via Online Knowledge Distillation☆28Updated 3 years ago
- ☆19Updated last year
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆42Updated last year
- Benchmarking Attention Mechanism in Vision Transformers.☆16Updated 2 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆21Updated 3 years ago
- Data-Free Neural Architecture Search via Recursive Label Calibration. ECCV 2022.☆32Updated 2 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆20Updated 2 years ago
- Paper List for In-context Learning 🌷☆20Updated last year
- ☆26Updated last year
- Code for ViTAS_Vision Transformer Architecture Search☆52Updated 3 years ago
- BESA is a differentiable weight pruning technique for large language models.☆14Updated 8 months ago
- [ICLR 2022]: Fast AdvProp☆35Updated 2 years ago
- PyTorch implementation of MLP-Mixer☆36Updated 3 years ago
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆36Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 3 years ago
- MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet. Also including detection knowledge distillation method☆14Updated 2 years ago
- [NeurIPS 2023] Towards Free Data Selection with General-Purpose Models☆32Updated 7 months ago
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆16Updated 2 years ago
- Mixture of Attention Heads☆39Updated 2 years ago