RUSH-LAB / SLIDELinks
☆471Updated 3 years ago
Alternatives and similar repositories for SLIDE
Users that are interested in SLIDE are comparing it to the libraries listed below
Sorting:
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,095Updated 4 years ago
- ☆74Updated last year
- ☆771Updated last year
- Fast Block Sparse Matrices for Pytorch☆548Updated 4 years ago
- PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.☆761Updated 2 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆239Updated 2 years ago
- A performant and modular runtime for TensorFlow☆759Updated 2 months ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆230Updated this week
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆338Updated last week
- PIX is an image processing library in JAX, for JAX.☆418Updated 3 months ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆716Updated 2 years ago
- The Foundation for All Legate Libraries☆218Updated last week
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 5 months ago
- Haste: a fast, simple, and open RNN library☆330Updated last year
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆477Updated 8 months ago
- common in-memory tensor structure☆1,019Updated 2 weeks ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,053Updated last year
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆194Updated 2 years ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆368Updated last week
- A High Level API for Deep Learning in JAX☆475Updated 2 years ago
- An open-source efficient deep learning framework/compiler, written in python.☆704Updated last week
- A platform for managing machine learning experiments☆858Updated last month
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,042Updated 2 years ago
- A library for distributed ML training with PyTorch☆366Updated 2 years ago
- 10x faster matrix and vector operations☆2,490Updated 2 years ago
- Library for 8-bit optimizers and quantization routines.☆715Updated 2 years ago
- Bagua Speeds up PyTorch☆883Updated 10 months ago
- Myia prototyping☆457Updated last year
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆251Updated 2 years ago
- A thin, highly portable toolkit for efficiently compiling dense loop-based computation.☆148Updated 2 years ago