samuelstanton / gnosis
Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.
☆33Updated last year
Alternatives and similar repositories for gnosis:
Users that are interested in gnosis are comparing it to the libraries listed below
- ☆57Updated 2 years ago
- Rethinking Bias-Variance Trade-off for Generalization of Neural Networks☆49Updated 4 years ago
- Code for "Just Train Twice: Improving Group Robustness without Training Group Information"☆70Updated 10 months ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Winning Solution of the NeurIPS 2020 Competition on Predicting Generalization in Deep Learning☆38Updated 3 years ago
- On the Importance of Gradients for Detecting Distributional Shifts in the Wild☆55Updated 2 years ago
- ☆19Updated 4 years ago
- ☆58Updated 3 years ago
- ☆23Updated 2 years ago
- Encodings for neural architecture search☆29Updated 3 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆81Updated 3 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆69Updated 2 years ago
- A Closer Look at Accuracy vs. Robustness☆88Updated 3 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- [ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang☆27Updated 2 years ago
- The official PyTorch implementation - Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from t…☆78Updated 2 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- Simple data balancing baselines for worst-group-accuracy benchmarks.☆42Updated last year
- Code for our ICLR'2021 paper "DrNAS: Dirichlet Neural Architecture Search"☆44Updated 3 years ago
- [NeurIPS 2020] "Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free" by Haotao Wang*, Tianlong C…☆43Updated 3 years ago
- Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization☆85Updated 2 years ago
- Official PyTorch implementation of “Flexible Dataset Distillation: Learn Labels Instead of Images”☆41Updated 4 years ago
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆23Updated 3 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆60Updated 4 years ago
- Offical Repo for Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks. Accepted by Neurips 2020.☆31Updated 4 years ago
- ☆22Updated 2 years ago
- Official implementation of paper Gradient Matching for Domain Generalization☆119Updated 3 years ago
- ICLR 2021, Fair Mixup: Fairness via Interpolation☆55Updated 3 years ago
- ☆54Updated 4 years ago
- Code for CVPR2021 paper: MOOD: Multi-level Out-of-distribution Detection☆38Updated last year