toshas / sttpLinks
Spectral Tensor Train Parameterization of Deep Learning Layers
☆16Updated 4 years ago
Alternatives and similar repositories for sttp
Users that are interested in sttp are comparing it to the libraries listed below
Sorting:
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆48Updated 4 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆77Updated last year
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 3 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆43Updated 6 years ago
- Official Release of "Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling"☆49Updated 5 years ago
- Code base for SRSGD.☆28Updated 5 years ago
- ☆59Updated 2 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆64Updated 4 years ago
- Monotone operator equilibrium networks☆54Updated 5 years ago
- ☆54Updated last year
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Updated 5 years ago
- ☆30Updated 5 years ago
- ☆64Updated last year
- PyTorch implementation of FIM and empirical FIM☆60Updated 7 years ago
- ☆35Updated 4 years ago
- ☆47Updated 6 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 4 years ago
- ☆59Updated 5 years ago
- Differentiable Optimizers with Perturbations in Pytorch☆69Updated 4 years ago
- ☆12Updated 3 years ago
- ☆42Updated 2 years ago
- Efficient Householder Transformation in PyTorch☆69Updated 4 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆35Updated 4 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 5 years ago
- Gradient Starvation: A Learning Proclivity in Neural Networks☆61Updated 4 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 4 years ago
- Adaptive gradient descent without descent☆50Updated 4 years ago
- Code for Understanding and Mitigating Exploding Inverses in Invertible Neural Networks (AISTATS 2021) http://arxiv.org/abs/2006.09347☆30Updated 5 years ago
- Implementation of Methods Proposed in Preventing Gradient Attenuation in Lipschitz Constrained Convolutional Networks (NeurIPS 2019)☆36Updated 5 years ago
- NeurIPS 2021, Code for Measuring Generalization with Optimal Transport☆28Updated 4 years ago