samuelstanton / gnosis
Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.
☆33Updated last year
Alternatives and similar repositories for gnosis:
Users that are interested in gnosis are comparing it to the libraries listed below
- ☆57Updated 2 years ago
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆37Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆30Updated 2 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆42Updated last year
- Rethinking Bias-Variance Trade-off for Generalization of Neural Networks☆49Updated 4 years ago
- ☆34Updated 2 weeks ago
- Model Zoos for Continual Learning (ICLR 22)☆45Updated last year
- Offical Repo for Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks. Accepted by Neurips 2020.☆32Updated 4 years ago
- Official PyTorch implementation of "Meta-Calibration: Learning of Model Calibration Using Differentiable Expected Calibration Error"☆34Updated last year
- Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization☆86Updated 2 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- Code for CVPR2021 paper: MOOD: Multi-level Out-of-distribution Detection☆38Updated last year
- Predicting Out-of-Distribution Error with the Projection Norm☆17Updated 2 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆23Updated 3 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆71Updated 11 months ago
- Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)☆19Updated 3 years ago
- Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"☆22Updated 3 years ago
- ☆58Updated 3 years ago
- On the Importance of Gradients for Detecting Distributional Shifts in the Wild☆56Updated 2 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆69Updated 2 years ago
- Active and Sample-Efficient Model Evaluation☆24Updated 4 years ago
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆29Updated 2 years ago
- [Re] Can gradient clipping mitigate label noise? (ML Reproducibility Challenge 2020)☆14Updated 8 months ago
- [ICML 2021] "Efficient Lottery Ticket Finding: Less Data is More" by Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang☆25Updated 3 years ago
- PRIME: A Few Primitives Can Boost Robustness to Common Corruptions☆42Updated 2 years ago
- Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxi…☆68Updated 3 years ago
- Deep Learning & Information Bottleneck☆60Updated last year
- Encodings for neural architecture search☆29Updated 4 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆81Updated 3 years ago