RUSH-LAB / SLIDELinks
☆470Updated 3 years ago
Alternatives and similar repositories for SLIDE
Users that are interested in SLIDE are comparing it to the libraries listed below
Sorting:
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,101Updated 4 years ago
- ☆74Updated last year
- ☆278Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆547Updated 4 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 9 months ago
- 10x faster matrix and vector operations☆2,501Updated 3 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆244Updated last week
- A performant and modular runtime for TensorFlow☆760Updated last month
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆148Updated 2 years ago
- Library for 8-bit optimizers and quantization routines.☆779Updated 3 years ago
- ☆772Updated last year
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆481Updated 11 months ago
- Lightweight machine learning library based on OpenCL 1.2☆75Updated 4 years ago
- Bagua Speeds up PyTorch☆883Updated last year
- PyTorch, TensorFlow, JAX and NumPy — all of them natively using the same code☆697Updated 2 years ago
- A library for distributed ML training with PyTorch☆367Updated 2 years ago
- MADGRAD Optimization Method☆804Updated 8 months ago
- PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.☆766Updated 2 years ago
- functorch is JAX-like composable function transforms for PyTorch.☆1,436Updated 2 months ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,617Updated last year
- PyTorch elastic training☆730Updated 3 years ago
- PyTorch interface for the IPU☆181Updated 2 years ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆251Updated 2 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,453Updated this week
- Reinforcement learning environments for compiler and program optimization tasks☆975Updated last year
- The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate i…☆87Updated 4 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆353Updated 4 months ago
- Haste: a fast, simple, and open RNN library☆333Updated 2 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,062Updated last year
- tree is a library for working with nested data structures☆1,009Updated 8 months ago