RUSH-LAB / SLIDELinks

☆471

Alternatives and similar repositories for SLIDE

Users that are interested in SLIDE are comparing it to the libraries listed below

Sorting:

keroro824 / HashingDeepLearning
Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"
☆1,095Updated 4 years ago
IntelLabs / SLIDE_opt_ia
☆74Updated last year
google / objax
☆771Updated last year
huggingface / pytorch_block_sparse
Fast Block Sparse Matrices for Pytorch
☆548Updated 4 years ago
Tencent / PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
☆761Updated 2 years ago
kingoflolz / swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆239Updated 2 years ago
tensorflow / runtime
A performant and modular runtime for TensorFlow
☆759Updated 2 months ago
ROCm / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆230Updated this week
facebookresearch / dietgpu
GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…
☆338Updated last week
google-deepmind / dm_pix
PIX is an image processing library in JAX, for JAX.
☆418Updated 3 months ago
jiazhihao / TASO
The Tensor Algebra SuperOptimizer for Deep Learning
☆716Updated 2 years ago
nv-legate / legate
The Foundation for All Legate Libraries
☆218Updated last week
nod-ai / SRT
Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …
☆106Updated 5 months ago
lmnt-com / haste
Haste: a fast, simple, and open RNN library
☆330Updated last year
microsoft / archai
Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.
☆477Updated 8 months ago
dmlc / dlpack
common in-memory tensor structure
☆1,019Updated 2 weeks ago
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,053Updated last year
facebookresearch / FBTT-Embedding
This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …
☆194Updated 2 years ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆368Updated last week
poets-ai / elegy
A High Level API for Deep Learning in JAX
☆475Updated 2 years ago
hidet-org / hidet
An open-source efficient deep learning framework/compiler, written in python.
☆704Updated last week
google-deepmind / xmanager
A platform for managing machine learning experiments
☆858Updated last month
openai / blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
☆1,042Updated 2 years ago
facebookresearch / moolib
A library for distributed ML training with PyTorch
☆366Updated 2 years ago
dblalock / bolt
10x faster matrix and vector operations
☆2,490Updated 2 years ago
facebookresearch / bitsandbytes
Library for 8-bit optimizers and quantization routines.
☆715Updated 2 years ago
BaguaSys / bagua
Bagua Speeds up PyTorch
☆883Updated 10 months ago
mila-iqia / myia
Myia prototyping
☆457Updated last year
pytorch / nestedtensor
[Prototype] Tools for the concurrent manipulation of variably sized Tensors.
☆251Updated 2 years ago
facebookresearch / loop_tool
A thin, highly portable toolkit for efficiently compiling dense loop-based computation.
☆148Updated 2 years ago