neuralmagic / sparsezooLinks
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
☆386Updated 3 weeks ago
Alternatives and similar repositories for sparsezoo
Users that are interested in sparsezoo are comparing it to the libraries listed below
Sorting:
- ML model optimization product to accelerate inference.☆324Updated this week
- Top-level directory for documentation and general content☆121Updated 2 weeks ago
- Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models☆2,137Updated last week
- Sparsity-aware deep learning inference runtime for CPUs☆3,147Updated this week
- Prune a model while finetuning or training.☆402Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆263Updated 7 months ago
- FasterAI: Prune and Distill your models with FastAI and PyTorch☆248Updated 2 months ago
- An open-source efficient deep learning framework/compiler, written in python.☆700Updated this week
- Fast sparse deep learning on CPUs☆53Updated 2 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆162Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆265Updated 3 years ago
- Highly optimized inference engine for Binarized Neural Networks☆250Updated 2 weeks ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆477Updated 7 months ago
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …☆235Updated 2 years ago
- A library for researching neural networks compression and acceleration methods.☆139Updated 9 months ago
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,046Updated last year
- Accelerate PyTorch models with ONNX Runtime☆361Updated 3 months ago
- ☆310Updated last week
- End-to-end training of sparse deep neural networks with little-to-no performance loss.☆322Updated 2 years ago
- PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.☆430Updated last year
- ☆204Updated 3 years ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 4 years ago
- An Open-Source Library for Training Binarized Neural Networks☆720Updated 9 months ago
- This repository contains the experimental PyTorch native float8 training UX☆223Updated 10 months ago
- ☆149Updated 2 years ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆366Updated last week
- Reference implementations of popular Binarized Neural Networks☆107Updated last month
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆544Updated this week
- Transform ONNX model to PyTorch representation☆336Updated 6 months ago