google-research / wide-sparse-nets
☆19Updated 4 years ago
Alternatives and similar repositories for wide-sparse-nets
Users that are interested in wide-sparse-nets are comparing it to the libraries listed below
Sorting:
- ☆37Updated 3 years ago
- ☆25Updated 4 years ago
- ☆11Updated 2 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- Code base for SRSGD.☆28Updated 5 years ago
- ☆17Updated 2 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆15Updated 3 years ago
- ☆22Updated 6 years ago
- PyTorch implementation of HashedNets☆36Updated 2 years ago
- Code for the CVPR 2021 paper: Understanding Failures of Deep Networks via Robust Feature Extraction☆36Updated 2 years ago
- SGD with large step sizes learns sparse features [ICML 2023]☆32Updated 2 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆17Updated 4 years ago
- ☆41Updated 2 years ago
- ☆19Updated 3 years ago
- Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients☆31Updated 3 years ago
- An adaptive training algorithm for residual network☆15Updated 4 years ago
- ☆23Updated 6 years ago
- Fine-grained ImageNet annotations☆29Updated 4 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- ☆35Updated last year
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Updated 2 years ago
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆42Updated 4 years ago
- Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."☆22Updated 5 years ago
- ☆13Updated 3 years ago
- ☆22Updated 2 years ago
- Large-batch Training, Neural Network Optimization☆9Updated 5 years ago
- Pruning applied to Facial Recognition.☆15Updated 5 years ago
- SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning☆23Updated 6 years ago
- Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent☆13Updated 5 years ago