HazyResearch / structured-netsLinks
Structured matrices for compressing neural networks
☆67Updated 2 years ago
Alternatives and similar repositories for structured-nets
Users that are interested in structured-nets are comparing it to the libraries listed below
Sorting:
- CUDA kernels for generalized matrix-multiplication in PyTorch☆85Updated 4 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆53Updated 4 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated 2 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 5 years ago
- ☆50Updated 5 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆77Updated last year
- Reparameterize your PyTorch modules☆71Updated 4 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆129Updated last year
- Monotone operator equilibrium networks☆53Updated 5 years ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆148Updated 2 years ago
- Efficient Householder Transformation in PyTorch☆66Updated 4 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 3 years ago
- This repository is no longer maintained. Check☆81Updated 5 years ago
- Pytorch implementation of the Power Spherical distribution☆74Updated last year
- Optimization with orthogonal constraints and on general manifolds☆131Updated 5 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆22Updated 5 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆43Updated 6 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆39Updated 4 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆63Updated 4 years ago
- Code for "Stochastic Optimization of Sorting Networks using Continuous Relaxations", ICLR 2019.☆146Updated 2 years ago
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆89Updated 3 years ago
- Limitations of the Empirical Fisher Approximation☆48Updated 8 months ago
- Hypergradient descent☆147Updated last year
- ☆30Updated 5 years ago
- paper lists and information on mean-field theory of deep learning☆78Updated 6 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆108Updated 4 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 5 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently☆51Updated last year