neuralmagic / sparsemlLinks

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

☆2,146

Alternatives and similar repositories for sparseml

Users that are interested in sparseml are comparing it to the libraries listed below

Sorting:

neuralmagic / deepsparse
Sparsity-aware deep learning inference runtime for CPUs
☆3,160Updated 2 months ago
neuralmagic / docs
Top-level directory for documentation and general content
☆121Updated 2 months ago
neuralmagic / sparsezoo
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
☆392Updated 2 months ago
neuralmagic / sparsify
ML model optimization product to accelerate inference.
☆326Updated 2 months ago
ELS-RD / transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
☆1,687Updated 9 months ago
Lightning-Universe / lightning-flash
Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains
☆1,741Updated last year
intel / neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…
☆2,465Updated this week
ELS-RD / kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…
☆1,577Updated last year
pytorch / data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
☆1,215Updated this week
libffcv / ffcv
FFCV: Fast Forward Computer Vision (and other ML workloads!)
☆2,955Updated last year
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,056Updated last year
facebookresearch / d2go
D2Go is a toolkit for efficient deep learning
☆848Updated 9 months ago
quic / aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
☆2,387Updated this week
facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
☆3,352Updated 3 months ago
facebookresearch / madgrad
MADGRAD Optimization Method
☆801Updated 6 months ago
openvinotoolkit / nncf
Neural Network Compression Framework for enhanced OpenVINO™ inference
☆1,070Updated this week
JonasGeiping / cramming
Cramming the training of a (BERT-type) language model into limited compute.
☆1,341Updated last year
tensorflow / similarity
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
☆1,023Updated last year
facebookincubator / AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆4,670Updated 2 weeks ago
huggingface / optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…
☆3,005Updated last week
pytorch / TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
☆2,824Updated this week
TorchStudio / torchstudio
IDE for PyTorch and its ecosystem
☆390Updated last year
huggingface / optimum-quanto
A pytorch quantization backend for optimum
☆979Updated last month
airctic / icevision
An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come
☆861Updated 8 months ago
microsoft / archai
Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.
☆480Updated 9 months ago
CalculatedContent / WeightWatcher
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
☆1,634Updated 2 months ago
huggingface / nn_pruning
Prune a model while finetuning or training.
☆403Updated 3 years ago
triton-inference-server / pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
☆814Updated last week
microsoft / mup
maximal update parametrization (µP)
☆1,576Updated last year
dblalock / bolt
10x faster matrix and vector operations
☆2,499Updated 2 years ago