neuralmagic / sparsify
ML model optimization product to accelerate inference.
☆322Updated 9 months ago
Alternatives and similar repositories for sparsify:
Users that are interested in sparsify are comparing it to the libraries listed below
- Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes☆376Updated 5 months ago
- Top-level directory for documentation and general content☆120Updated last month
- Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models☆2,089Updated 5 months ago
- Sparsity-aware deep learning inference runtime for CPUs☆3,077Updated 5 months ago
- Accelerate PyTorch models with ONNX Runtime☆358Updated 4 months ago
- Library for 8-bit optimizers and quantization routines.☆717Updated 2 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,016Updated 9 months ago
- Recipes are a standard, well supported set of blueprints for machine learning engineers to rapidly train models using the latest research…☆301Updated this week
- A library to inspect and extract intermediate layers of PyTorch models.☆470Updated 2 years ago
- The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX☆413Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆257Updated 3 months ago
- FasterAI: Prune and Distill your models with FastAI and PyTorch☆246Updated last month
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …☆235Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆336Updated last week
- Memory mapped numpy arrays of varying shapes☆291Updated 6 months ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆237Updated last year
- Prune a model while finetuning or training.☆397Updated 2 years ago
- A library for distributed ML training with PyTorch☆366Updated 2 years ago
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆321Updated 2 months ago
- Implementation of a Transformer, but completely in Triton☆251Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆547Updated 3 years ago
- Named tensors with first-class dimensions for PyTorch☆322Updated last year
- Productionize machine learning predictions, with ONNX or without☆66Updated last year
- Pytorch Lightning Distributed Accelerators using Ray☆211Updated last year
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆349Updated this week
- An open-source efficient deep learning framework/compiler, written in python.☆668Updated this week
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆345Updated this week
- A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to fac…☆224Updated last month
- Implementation of Flash Attention in Jax☆204Updated 10 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆304Updated this week