RUSH-LAB / SLIDELinks
☆471Updated 4 years ago
Alternatives and similar repositories for SLIDE
Users that are interested in SLIDE are comparing it to the libraries listed below
Sorting:
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,106Updated 4 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆247Updated this week
- 10x faster matrix and vector operations☆2,516Updated 3 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated last month
- ☆279Updated 2 years ago
- Example code and applications for machine learning on Graphcore IPUs☆333Updated last year
- The official page of ROCm/PyTorch will contain information that is always confusing. On this page we will endeavor to describe accurate i…☆87Updated 4 years ago
- A uniform interface to run deep learning models from multiple frameworks☆941Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆550Updated 5 years ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆484Updated 2 months ago
- PyTorch, TensorFlow, JAX and NumPy — all of them natively using the same code☆699Updated 2 years ago
- PyTorch elastic training☆728Updated 3 years ago
- MADGRAD Optimization Method☆804Updated last year
- A library for distributed ML training with PyTorch☆366Updated 3 years ago
- GPU fan control for headless Linux☆349Updated 2 years ago
- Bagua Speeds up PyTorch☆884Updated last year
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242Updated 2 years ago
- PyTorch interface for the IPU☆181Updated 2 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 5 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆370Updated 3 weeks ago
- A performant and modular runtime for TensorFlow☆754Updated 4 months ago
- Large Model Support in Tensorflow☆202Updated 5 years ago
- ☆775Updated 2 years ago
- TensorFlow ROCm port☆699Updated this week
- Large Model Support in PyTorch☆136Updated 3 years ago
- Library for 8-bit optimizers and quantization routines.☆780Updated 3 years ago
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆149Updated 3 years ago
- The Foundation for All Legate Libraries☆233Updated this week
- Code for Parameter Prediction for Unseen Deep Architectures (NeurIPS 2021)☆490Updated 2 years ago
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆334Updated 3 years ago