RUSH-LAB / SLIDELinks
☆471Updated 3 years ago
Alternatives and similar repositories for SLIDE
Users that are interested in SLIDE are comparing it to the libraries listed below
Sorting:
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,100Updated 4 years ago
- ☆74Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆237Updated this week
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 7 months ago
- Fast Block Sparse Matrices for Pytorch☆548Updated 4 years ago
- ☆278Updated 2 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆346Updated last month
- ☆771Updated last year
- PyTorch elastic training☆729Updated 3 years ago
- Bagua Speeds up PyTorch☆882Updated last year
- A library for distributed ML training with PyTorch☆366Updated 2 years ago
- 10x faster matrix and vector operations☆2,498Updated 2 years ago
- PyTorch interface for the IPU☆180Updated last year
- Benchmark Suite for Deep Learning☆272Updated 5 months ago
- A performant and modular runtime for TensorFlow☆758Updated last week
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆381Updated this week
- MADGRAD Optimization Method☆801Updated 6 months ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,613Updated last year
- Highly optimized inference engine for Binarized Neural Networks☆251Updated this week
- Fork of TensorFlow accelerated by DirectML☆469Updated 10 months ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆480Updated 9 months ago
- A uniform interface to run deep learning models from multiple frameworks☆939Updated last year
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,056Updated last year
- functorch is JAX-like composable function transforms for PyTorch.☆1,436Updated this week
- The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate i…☆87Updated 4 years ago
- Library for 8-bit optimizers and quantization routines.☆772Updated 2 years ago
- TensorFlow ROCm port☆694Updated this week
- Haste: a fast, simple, and open RNN library☆332Updated 2 years ago
- implement AlexNet with C / convolutional nerual network / machine learning / computer vision☆191Updated 3 years ago
- Example code and applications for machine learning on Graphcore IPUs☆325Updated last year