RUSH-LAB / SLIDELinks
☆471Updated 4 years ago
Alternatives and similar repositories for SLIDE
Users that are interested in SLIDE are comparing it to the libraries listed below
Sorting:
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,106Updated 4 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆247Updated this week
- 10x faster matrix and vector operations☆2,516Updated 3 years ago
- A uniform interface to run deep learning models from multiple frameworks☆940Updated 2 years ago
- ☆279Updated 2 years ago
- Fork of TensorFlow accelerated by DirectML☆472Updated last year
- A performant and modular runtime for TensorFlow☆753Updated 5 months ago
- tree is a library for working with nested data structures☆1,016Updated last week
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated last month
- Fast Block Sparse Matrices for Pytorch☆550Updated 5 years ago
- ☆775Updated 2 years ago
- PyTorch elastic training☆728Updated 3 years ago
- The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate i…☆87Updated 4 years ago
- MADGRAD Optimization Method☆804Updated last year
- PyTorch, TensorFlow, JAX and NumPy — all of them natively using the same code☆700Updated 2 years ago
- A library for distributed ML training with PyTorch☆366Updated 3 years ago
- Continuous builder and binary build scripts for pytorch☆356Updated 5 months ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242Updated 2 years ago
- Haste: a fast, simple, and open RNN library☆337Updated 2 years ago
- Bagua Speeds up PyTorch☆884Updated last year
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆370Updated 3 weeks ago
- Example code and applications for machine learning on Graphcore IPUs☆333Updated last year
- Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tupl…☆816Updated 5 months ago
- Large Model Support in Tensorflow☆202Updated 5 years ago
- common in-memory tensor structure☆1,161Updated last week
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆149Updated 3 years ago
- Accelerate PyTorch models with ONNX Runtime☆367Updated this week
- A tensor-aware point-to-point communication primitive for machine learning☆283Updated last month
- The Foundation for All Legate Libraries☆233Updated this week
- Mesh TensorFlow: Model Parallelism Made Easier☆1,625Updated 2 years ago