toshas / sttp
Spectral Tensor Train Parameterization of Deep Learning Layers
☆15Updated 3 years ago
Alternatives and similar repositories for sttp:
Users that are interested in sttp are comparing it to the libraries listed below
- ☆30Updated 4 years ago
- Implementations of orthogonal and semi-orthogonal convolutions in the Fourier domain with applications to adversarial robustness☆44Updated 4 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 9 months ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 3 years ago
- ☆47Updated 5 years ago
- Efficient Householder Transformation in PyTorch☆65Updated 3 years ago
- ☆32Updated 2 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆40Updated 6 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 4 years ago
- Implicit networks can be trained efficiently and simply by using Jacobian-free Backprop (JFB).☆35Updated 3 years ago
- Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"☆33Updated 2 years ago
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆87Updated 2 years ago
- ICLR22 "Fast Differentiable Matrix Square Root" and T-PAMI extension☆61Updated 5 months ago
- ☆53Updated 9 months ago
- ☆60Updated 4 years ago
- ☆11Updated 2 years ago
- ☆64Updated last year
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆21Updated 6 months ago
- Code to accompany paper 'Bayesian Deep Ensembles via the Neural Tangent Kernel'☆26Updated 4 years ago
- Refining continuous-in-depth neural networks☆39Updated 3 years ago
- Official Release of "Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling"☆49Updated 4 years ago
- ☆28Updated 3 years ago
- Code base for SRSGD.☆28Updated 5 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆31Updated 3 years ago
- Featurized Density Ratio Estimation☆20Updated 3 years ago
- ☆24Updated 4 years ago
- ☆16Updated 2 years ago
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Updated 5 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago