neuralmagic / sparsifyLinks
ML model optimization product to accelerate inference.
☆325Updated 8 months ago
Alternatives and similar repositories for sparsify
Users that are interested in sparsify are comparing it to the libraries listed below
Sorting:
- Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes☆387Updated 8 months ago
- Top-level directory for documentation and general content☆120Updated 8 months ago
- Accelerate PyTorch models with ONNX Runtime☆367Updated this week
- FasterAI: Prune and Distill your models with FastAI and PyTorch☆252Updated this week
- Lite Inference Toolkit (LIT) for PyTorch☆160Updated 4 years ago
- Recipes are a standard, well supported set of blueprints for machine learning engineers to rapidly train models using the latest research…☆340Updated this week
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …☆237Updated 2 years ago
- Library for 8-bit optimizers and quantization routines.☆780Updated 3 years ago
- Implementation of a Transformer, but completely in Triton☆279Updated 3 years ago
- IDE for PyTorch and its ecosystem☆393Updated last year
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆184Updated 3 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆117Updated 4 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,072Updated last year
- The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX☆423Updated last year
- An open-source efficient deep learning framework/compiler, written in python.☆740Updated 5 months ago
- Prune a model while finetuning or training.☆406Updated 3 years ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆87Updated last year
- A library to inspect and extract intermediate layers of PyTorch models.☆476Updated 3 years ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆242Updated 2 years ago
- Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀☆119Updated 2 years ago
- ☆374Updated this week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆420Updated this week
- Memory mapped numpy arrays of varying shapes☆308Updated this week
- Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.☆382Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆412Updated this week
- Open Source Photos Platform Powered by PyTorch☆136Updated 3 years ago
- Language Modeling with the H3 State Space Model☆522Updated 2 years ago
- An alternative to convolution in neural networks☆259Updated last year
- A repository for log-time feedforward networks☆224Updated last year
- Aloception is a set of package for computer vision: aloscene, alodataset, alonet.☆93Updated 8 months ago