JunLi-Galios / Optimization-on-Stiefel-Manifold-via-Cayley-Transform
Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform
☆37Updated 5 years ago
Alternatives and similar repositories for Optimization-on-Stiefel-Manifold-via-Cayley-Transform:
Users that are interested in Optimization-on-Stiefel-Manifold-via-Cayley-Transform are comparing it to the libraries listed below
- ☆58Updated last year
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆72Updated 6 months ago
- Code base for SRSGD.☆28Updated 4 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- ☆36Updated 3 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆48Updated 3 years ago
- ☆47Updated 5 years ago
- ☆31Updated 4 years ago
- ☆63Updated 2 months ago
- Code for testing DCT plus Sparse (DCTpS) networks☆14Updated 3 years ago
- Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)☆17Updated 5 years ago
- Spectral Tensor Train Parameterization of Deep Learning Layers☆15Updated 3 years ago
- Structured matrices for compressing neural networks☆66Updated last year
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆43Updated 3 years ago
- Pytorch implementation of KFAC and E-KFAC (Natural Gradient).☆130Updated 5 years ago
- ☆67Updated 5 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- Distributional and Outlier Robust Optimization (ICML 2021)☆26Updated 3 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 4 years ago
- Visualization of mean field and neural tangent kernel regime☆21Updated 6 months ago
- The codebase for the paper "A Tail-Index Analysis of Stochastic Gradient Noise in Deep Neural Networks"☆23Updated 5 years ago
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆42Updated 4 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆17Updated 5 years ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 3 years ago
- Limitations of the Empirical Fisher Approximation☆47Updated 4 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- ☆89Updated 3 years ago
- Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J☆65Updated 10 months ago