hyeon95y / SparseLinearLinks
A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently
☆51Updated last year
Alternatives and similar repositories for SparseLinear
Users that are interested in SparseLinear are comparing it to the libraries listed below
Sorting:
- Structured matrices for compressing neural networks☆67Updated 2 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆90Updated 3 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 4 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆104Updated 5 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated 2 years ago
- ☆100Updated 3 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆129Updated last year
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆63Updated 4 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- Distributed K-FAC preconditioner for PyTorch☆90Updated this week
- Implementation of deep implicit attention in PyTorch☆65Updated 4 years ago
- Euclidean Wasserstein-2 optimal transportation☆47Updated 2 years ago
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆81Updated last year
- Differentiable Algorithms and Algorithmic Supervision.☆116Updated 2 years ago
- ☆164Updated 2 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆76Updated last year
- Butterfly matrix multiplication in PyTorch☆173Updated 2 years ago
- Transformers with doubly stochastic attention☆48Updated 3 years ago
- {KFAC,EKFAC,Diagonal,Implicit} Fisher Matrices and finite width NTKs in PyTorch☆215Updated last month
- Differentiable Sorting Networks☆117Updated 2 years ago
- Code for: "Neural Rough Differential Equations for Long Time Series", (ICML 2021)☆118Updated 4 years ago
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- Pytorch implementation of VAEs for heterogeneous likelihoods.☆42Updated 2 years ago
- ☆67Updated 6 years ago
- CUDA kernels for generalized matrix-multiplication in PyTorch☆85Updated 3 years ago
- ☆50Updated 4 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆183Updated last week
- Stochastic Automatic Differentiation library for PyTorch.☆208Updated last year