neuralmagic / sparsezooLinks

Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes

☆389

Alternatives and similar repositories for sparsezoo

Users that are interested in sparsezoo are comparing it to the libraries listed below

Sorting:

neuralmagic / sparsify
ML model optimization product to accelerate inference.
☆324Updated 4 months ago
neuralmagic / docs
Top-level directory for documentation and general content
☆120Updated 4 months ago
neuralmagic / sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
☆2,145Updated 4 months ago
neuralmagic / deepsparse
Sparsity-aware deep learning inference runtime for CPUs
☆3,155Updated 4 months ago
hidet-org / hidet
An open-source efficient deep learning framework/compiler, written in python.
☆732Updated last month
huggingface / nn_pruning
Prune a model while finetuning or training.
☆405Updated 3 years ago
pytorch / ort
Accelerate PyTorch models with ONNX Runtime
☆365Updated 8 months ago
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,064Updated last year
nebuly-ai / exploring-AI-optimization
Curated list of awesome material on optimization techniques to make artificial intelligence faster and more efficient 🚀
☆119Updated 2 years ago
facebookresearch / recipes
Recipes are a standard, well supported set of blueprints for machine learning engineers to rapidly train models using the latest research…
☆329Updated last week
FasterAI-Labs / fasterai
FasterAI: Prune and Distill your models with FastAI and PyTorch
☆249Updated last week
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆276Updated 3 years ago
facebookresearch / bitsandbytes
Library for 8-bit optimizers and quantization routines.
☆780Updated 3 years ago
SonySemiconductorSolutions / mct-model-optimization
Model Compression Toolkit (MCT) is an open source project for neural network model optimization under efficient, constrained hardware. Th…
☆419Updated last week
facebookresearch / diffq
DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …
☆236Updated 2 years ago
google / aqt
☆335Updated last month
microsoft / onnxscript
ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.
☆404Updated last week
neuralmagic / nm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆266Updated last year
IntelLabs / Model-Compression-Research-Package
A library for researching neural networks compression and acceleration methods.
☆139Updated last month
ShishirPatil / poet
ML model training for edge devices
☆167Updated 2 years ago
huggingface / optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆502Updated this week
scailable / sclblonnx
Scailable ONNX python tools
☆97Updated last year
huggingface / optimum-graphcore
Blazing fast training of 🤗 Transformers on Graphcore IPUs
☆85Updated last year
meta-pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆395Updated last week
LukasHedegaard / pytorch-benchmark
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
☆107Updated 2 years ago
meta-pytorch / float8_experimental
This repository contains the experimental PyTorch native float8 training UX
☆223Updated last year
marsupialtail / sparsednn
Fast sparse deep learning on CPUs
☆56Updated 3 years ago
linjames0 / Transformer-CUDA
An implementation of the transformer architecture onto an Nvidia CUDA kernel
☆191Updated 2 years ago
Qualcomm-AI-research / transformer-quantization
☆205Updated 3 years ago
mlcommons / algorithmic-efficiency
MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…
☆400Updated this week