harvardnlp / genbmm
CUDA kernels for generalized matrix-multiplication in PyTorch
☆79Updated 3 years ago
Alternatives and similar repositories for genbmm:
Users that are interested in genbmm are comparing it to the libraries listed below
- Structured matrices for compressing neural networks☆66Updated last year
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆74Updated 7 months ago
- Repository containing Pytorch code for EKFAC and K-FAC perconditioners.☆141Updated last year
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆103Updated 3 years ago
- Fast Discounted Cumulative Sums in PyTorch☆95Updated 3 years ago
- Silly twitter torch implementations.☆46Updated 2 years ago
- ☆49Updated 4 years ago
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Code for the ICML'20 paper "Improving Transformer Optimization Through Better Initialization"☆88Updated 4 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- ☆42Updated 3 years ago
- Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…☆62Updated 4 years ago
- LP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs☆41Updated last year
- MAE: Mutual Posterior-Divergence Regularization for Variational AutoEncoders☆23Updated 5 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆51Updated 4 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆77Updated 4 years ago
- Generic PyTorch implementation of einsum that supports different semirings☆47Updated 8 months ago
- Experiment code for "Randomized Automatic Differentiation"☆66Updated 4 years ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆39Updated 4 years ago
- Reparameterize your PyTorch modules☆70Updated 4 years ago
- Block-sparse primitives for PyTorch☆154Updated 3 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22Updated 5 years ago
- Masked Convolutional Flow☆59Updated 4 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- Optimization with orthogonal constraints and on general manifolds☆127Updated 4 years ago
- Pytorch library for factorized L0-based pruning.☆44Updated last year
- Efficient reservoir sampling implementation for PyTorch☆107Updated 3 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- Implementation of Stochastic Beam Search using Fairseq☆99Updated 5 years ago