toshas / sttpLinks
Spectral Tensor Train Parameterization of Deep Learning Layers
☆15Updated 4 years ago
Alternatives and similar repositories for sttp
Users that are interested in sttp are comparing it to the libraries listed below
Sorting:
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆46Updated 4 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 11 months ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆41Updated 6 years ago
- ☆30Updated 4 years ago
- Monotone operator equilibrium networks☆53Updated 5 years ago
- ☆60Updated 5 years ago
- ☆47Updated 5 years ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 3 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆63Updated 4 years ago
- ☆12Updated 3 years ago
- Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"☆33Updated 2 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆105Updated 4 years ago
- [JMLR] TRADES + random smoothing for certifiable robustness☆14Updated 4 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆32Updated 3 years ago
- ☆54Updated 11 months ago
- ☆59Updated 2 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 4 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆38Updated 4 years ago
- Optimization with orthogonal constraints and on general manifolds☆129Updated 5 years ago
- Efficient Householder Transformation in PyTorch☆66Updated 4 years ago
- Code for Understanding and Mitigating Exploding Inverses in Invertible Neural Networks (AISTATS 2021) http://arxiv.org/abs/2006.09347☆30Updated 4 years ago
- Official Release of "Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling"☆49Updated 5 years ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Updated 5 years ago
- Adaptive gradient descent without descent☆48Updated 3 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 4 years ago
- ☆36Updated 3 years ago
- ☆20Updated 5 years ago
- ☆40Updated 5 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- The official code for Efficient Learning of Generative Models via Finite-Difference Score Matching☆12Updated 2 years ago