neuralmagic / sparsify
ML model optimization product to accelerate inference.
☆326Updated 11 months ago
Alternatives and similar repositories for sparsify:
Users that are interested in sparsify are comparing it to the libraries listed below
- Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes☆382Updated 8 months ago
- Top-level directory for documentation and general content☆121Updated 3 months ago
- Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models☆2,118Updated 7 months ago
- Sparsity-aware deep learning inference runtime for CPUs☆3,117Updated 8 months ago
- Accelerate PyTorch models with ONNX Runtime☆358Updated 3 weeks ago
- FasterAI: Prune and Distill your models with FastAI and PyTorch☆247Updated this week
- An open-source efficient deep learning framework/compiler, written in python.☆691Updated 3 weeks ago
- Prune a model while finetuning or training.☆400Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆262Updated 5 months ago
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …☆235Updated last year
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆84Updated last year
- Implementation of a Transformer, but completely in Triton☆260Updated 2 years ago
- Library for 8-bit optimizers and quantization routines.☆718Updated 2 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆160Updated 2 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,035Updated 11 months ago
- ☆289Updated last week
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆236Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆351Updated this week
- The Triton backend for the ONNX Runtime.☆139Updated last week
- Torch Distributed Experimental☆115Updated 7 months ago
- An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come☆856Updated 3 months ago
- Language Modeling with the H3 State Space Model☆516Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,559Updated last year
- Fast Block Sparse Matrices for Pytorch☆546Updated 4 years ago
- This repository contains the experimental PyTorch native float8 training UX☆222Updated 7 months ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆474Updated 4 months ago
- Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory☆435Updated 6 months ago
- ☆247Updated 7 months ago
- Scailable ONNX python tools☆97Updated 4 months ago
- A library to inspect and extract intermediate layers of PyTorch models.☆472Updated 2 years ago