hyeon95y / SparseLinearLinks
A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently
☆50Updated last year
Alternatives and similar repositories for SparseLinear
Users that are interested in SparseLinear are comparing it to the libraries listed below
Sorting:
- Structured matrices for compressing neural networks☆67Updated last year
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 4 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆105Updated 4 years ago
- 🧀 Pytorch code for the Fromage optimiser.☆125Updated last year
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆89Updated 3 years ago
- Implementation of deep implicit attention in PyTorch☆65Updated 4 years ago
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆147Updated last year
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆62Updated 4 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 4 years ago
- Official code for UnICORNN (ICML 2021)☆27Updated 3 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 3 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated last year
- ☆50Updated 4 years ago
- Distributed K-FAC preconditioner for PyTorch☆89Updated this week
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- ☆100Updated 3 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆180Updated last week
- A TensorFlow implementation of the paper 'Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks'☆31Updated last year
- Differentiable Sorting Networks☆117Updated last year
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆38Updated 4 years ago
- Butterfly matrix multiplication in PyTorch☆174Updated last year
- ☆54Updated last year
- ☆67Updated 6 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆115Updated 2 years ago
- Transformers with doubly stochastic attention☆46Updated 2 years ago
- Very deep VAEs in JAX/Flax☆46Updated 4 years ago
- Neural Turing Machines in pytorch☆48Updated 3 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆49Updated last month