hyeon95y / SparseLinear
A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently
☆48Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for SparseLinear
- Structured matrices for compressing neural networks☆67Updated last year
- Sequence Modeling with Structured State Spaces☆60Updated 2 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆68Updated 2 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆77Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆100Updated 3 years ago
- Code for the paper: "Tensor Programs II: Neural Tangent Kernel for Any Architecture"☆97Updated 4 years ago
- Implementation of deep implicit attention in PyTorch☆63Updated 3 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (affine group preconditioner, low-rank approximation preconditioner …☆128Updated last month
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- A GPT, made only of MLPs, in Jax☆55Updated 3 years ago
- ☆49Updated 4 years ago
- ☆15Updated 4 years ago
- Code base for SRSGD.☆28Updated 4 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆105Updated last year
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆76Updated 5 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆104Updated 2 years ago
- Euclidean Wasserstein-2 optimal transportation☆42Updated last year
- Code release to accompany paper "Geometry-Aware Gradient Algorithms for Neural Architecture Search."☆23Updated 4 years ago
- Fast Discounted Cumulative Sums in PyTorch☆95Updated 3 years ago
- Pytorch library for factorized L0-based pruning.☆43Updated last year
- ☆46Updated last month
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 3 years ago
- Official code for UnICORNN (ICML 2021)☆27Updated 3 years ago
- CUDA kernels for generalized matrix-multiplication in PyTorch☆79Updated 3 years ago
- Block-sparse primitives for PyTorch☆148Updated 3 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆120Updated last year
- 👩 Pytorch and Jax code for the Madam optimiser.☆51Updated 3 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆39Updated 4 years ago
- ☆97Updated 2 years ago