RUSH-LAB / SLIDELinks
☆471Updated 4 years ago
Alternatives and similar repositories for SLIDE
Users that are interested in SLIDE are comparing it to the libraries listed below
Sorting:
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,106Updated 4 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆247Updated last week
- ☆279Updated 2 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated last month
- Fast Block Sparse Matrices for Pytorch☆550Updated 5 years ago
- A performant and modular runtime for TensorFlow☆753Updated 5 months ago
- 10x faster matrix and vector operations☆2,516Updated 3 years ago
- A uniform interface to run deep learning models from multiple frameworks☆940Updated 2 years ago
- Haste: a fast, simple, and open RNN library☆337Updated 2 years ago
- Bagua Speeds up PyTorch☆884Updated last year
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆370Updated 3 weeks ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆250Updated 3 years ago
- ☆775Updated 2 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,525Updated this week
- PyTorch elastic training☆728Updated 3 years ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆484Updated 2 months ago
- TensorFlow ROCm port☆699Updated this week
- Fork of TensorFlow accelerated by DirectML☆472Updated last year
- Benchmark Suite for Deep Learning☆281Updated last month
- The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate i…☆87Updated 4 years ago
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆194Updated 3 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242Updated 2 years ago
- Lightweight machine learning library based on OpenCL 1.2☆75Updated 5 years ago
- common in-memory tensor structure☆1,161Updated last week
- PyTorch interface for the IPU☆181Updated 2 years ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,245Updated this week
- Continuous builder and binary build scripts for pytorch☆356Updated 5 months ago
- A tensor-aware point-to-point communication primitive for machine learning☆283Updated last month
- Example code and applications for machine learning on Graphcore IPUs☆333Updated last year
- MADGRAD Optimization Method☆804Updated last year