hyeon95y / SparseLinearLinks
A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently
☆51Updated last year
Alternatives and similar repositories for SparseLinear
Users that are interested in SparseLinear are comparing it to the libraries listed below
Sorting:
- Structured matrices for compressing neural networks☆67Updated last year
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆90Updated 3 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 4 years ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62Updated 4 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆105Updated 5 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated last year
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆38Updated 4 years ago
- Distributed K-FAC preconditioner for PyTorch☆89Updated last week
- ☆50Updated 4 years ago
- Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform☆42Updated 6 years ago
- Stochastic Automatic Differentiation library for PyTorch.☆206Updated 11 months ago
- Transformers with doubly stochastic attention☆47Updated 2 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆126Updated last year
- Code release to accompany paper "Geometry-Aware Gradient Algorithms for Neural Architecture Search."☆25Updated 4 years ago
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆80Updated last year
- Visualizing the the loss landscape of Fully-Connected Neural Networks☆46Updated 2 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆215Updated 2 months ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated this week
- Implementation of deep implicit attention in PyTorch☆65Updated 4 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 4 years ago
- Meta Optimal Transport☆103Updated 2 years ago
- ☆67Updated 6 years ago
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆111Updated 2 years ago
- Explores the ideas presented in Deep Ensembles: A Loss Landscape Perspective (https://arxiv.org/abs/1912.02757) by Stanislav Fort, Huiyi …☆65Updated 5 years ago
- ☆47Updated last year
- Butterfly matrix multiplication in PyTorch☆174Updated last year