RUSH-LAB / SLIDE
☆471Updated 3 years ago
Alternatives and similar repositories for SLIDE
Users that are interested in SLIDE are comparing it to the libraries listed below
Sorting:
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,096Updated 4 years ago
- ☆74Updated last year
- ☆770Updated last year
- 10x faster matrix and vector operations☆2,485Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 4 years ago
- A performant and modular runtime for TensorFlow☆761Updated last month
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆227Updated this week
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆320Updated 2 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆335Updated last month
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,324Updated this week
- A High Level API for Deep Learning in JAX☆475Updated 2 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆239Updated 2 years ago
- A uniform interface to run deep learning models from multiple frameworks☆935Updated last year
- Lightweight machine learning library based on OpenCL 1.2☆73Updated 4 years ago
- High performance model preprocessing library on PyTorch☆650Updated last year
- Accelerate PyTorch models with ONNX Runtime☆360Updated 2 months ago
- This is a Tensor Train based compression library to compress sparse embedding tables used in large-scale machine learning models such as …☆194Updated 2 years ago
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- [Prototype] Tools for the concurrent manipulation of variably sized Tensors.☆251Updated 2 years ago
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆224Updated this week
- ☆323Updated last year
- ☆278Updated 2 years ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆362Updated last week
- PyTorch elastic training☆729Updated 2 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆711Updated 2 years ago
- High-efficiency floating-point neural network inference operators for mobile, server, and Web☆2,016Updated this week
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆477Updated 6 months ago
- An open-source efficient deep learning framework/compiler, written in python.☆696Updated this week
- Large Model Support in Tensorflow☆202Updated 4 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,044Updated last year