HazyResearch / butterfly
Butterfly matrix multiplication in PyTorch
☆164Updated last year
Related projects ⓘ
Alternatives and complementary repositories for butterfly
- Block-sparse primitives for PyTorch☆148Updated 3 years ago
- Structured matrices for compressing neural networks☆67Updated last year
- ☆194Updated last year
- Customized matrix multiplication kernels☆53Updated 2 years ago
- ☆267Updated last week
- ☆148Updated 5 months ago
- JMP is a Mixed Precision library for JAX.☆187Updated 6 months ago
- Low Precision Arithmetic Simulation in PyTorch☆265Updated 6 months ago
- ASDL: Automatic Second-order Differentiation Library for PyTorch☆179Updated 3 months ago
- ☆207Updated 6 months ago
- A library of GPU kernels for sparse matrix operations.☆249Updated 3 years ago
- Distributed K-FAC Preconditioner for PyTorch☆80Updated this week
- Pytorch implementation of preconditioned stochastic gradient descent (affine group preconditioner, low-rank approximation preconditioner …☆127Updated last month
- A library for unit scaling in PyTorch☆105Updated 2 weeks ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆104Updated 2 years ago
- Programmable Neural Network Compression☆147Updated 2 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- ☆11Updated 2 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆160Updated last year
- jax-triton contains integrations between JAX and OpenAI Triton☆343Updated 3 weeks ago
- ☆143Updated last year
- Experiment of using Tangent to autodiff triton☆72Updated 9 months ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆129Updated 2 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆60Updated 8 months ago
- JAX-Toolbox☆245Updated this week
- ☆33Updated last year
- PyTorch-SSO: Scalable Second-Order methods in PyTorch☆142Updated last year
- Accelerated First Order Parallel Associative Scan☆163Updated 3 months ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆49Updated 6 years ago
- Research and development for optimizing transformers☆125Updated 3 years ago