HazyResearch / butterfly
Butterfly matrix multiplication in PyTorch
☆169Updated last year
Alternatives and similar repositories for butterfly:
Users that are interested in butterfly are comparing it to the libraries listed below
- Block-sparse primitives for PyTorch☆154Updated 4 years ago
- ☆202Updated 2 years ago
- Distributed K-FAC Preconditioner for PyTorch☆85Updated this week
- Structured matrices for compressing neural networks☆66Updated last year
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- Customized matrix multiplication kernels☆54Updated 3 years ago
- Low Precision Arithmetic Simulation in PyTorch☆274Updated 10 months ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆185Updated 4 months ago
- ☆83Updated 5 years ago
- ☆224Updated 2 months ago
- ☆163Updated 10 months ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆78Updated 4 years ago
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆321Updated 2 years ago
- ☆36Updated 4 months ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 8 months ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆274Updated 2 years ago
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆106Updated last year
- Training neural networks in TensorFlow 2.0 with 5x less memory☆130Updated 3 years ago
- A library for unit scaling in PyTorch☆125Updated 4 months ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆49Updated 4 years ago
- Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"☆74Updated 5 years ago
- JMP is a Mixed Precision library for JAX.☆194Updated 2 months ago
- A research library for pytorch-based neural network pruning, compression, and more.☆160Updated 2 years ago
- ☆295Updated this week
- Bibtex for Sparsity in Deep Learning paper (https://arxiv.org/abs/2102.00554) - open for pull requests☆45Updated 2 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆51Updated 7 years ago
- Deep learning with a multiplication budget☆47Updated 6 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆171Updated last week
- ☆10Updated 3 years ago
- Code for "Picking Winning Tickets Before Training by Preserving Gradient Flow" https://openreview.net/pdf?id=SkgsACVKPH☆103Updated 5 years ago