neuralmagic / sparseml
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
☆2,118Updated 7 months ago
Alternatives and similar repositories for sparseml:
Users that are interested in sparseml are comparing it to the libraries listed below
- Sparsity-aware deep learning inference runtime for CPUs☆3,117Updated 8 months ago
- Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes☆382Updated 8 months ago
- ML model optimization product to accelerate inference.☆326Updated 11 months ago
- Top-level directory for documentation and general content☆121Updated 3 months ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,559Updated last year
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,037Updated 11 months ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,680Updated 4 months ago
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,355Updated this week
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,906Updated 9 months ago
- PyTorch extensions for high performance and large scale training.☆3,278Updated 2 months ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,711Updated this week
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,249Updated last week
- Library for 8-bit optimizers and quantization routines.☆717Updated 2 years ago
- ONNX Optimizer☆681Updated this week
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆2,812Updated 2 weeks ago
- Transformer related optimization, including BERT, GPT☆6,084Updated 11 months ago
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.☆1,177Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,620Updated 3 months ago
- Recipes are a standard, well supported set of blueprints for machine learning engineers to rapidly train models using the latest research…☆310Updated this week
- Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains☆1,745Updated last year
- A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆25 knowledge distillation methods p…☆1,467Updated last week
- A data augmentations library for audio, image, text, and video.☆4,991Updated 3 weeks ago
- An open-source efficient deep learning framework/compiler, written in python.☆691Updated 3 weeks ago
- TensorFlow Similarity is a python package focused on making similarity learning quick and easy.☆1,019Updated 10 months ago
- CVNets: A library for training computer vision networks☆1,841Updated last year
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs…☆2,293Updated this week
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆986Updated this week
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆366Updated this week
- Official code Cross-Covariance Image Transformer (XCiT)☆669Updated 3 years ago
- An Agnostic Computer Vision Framework - Pluggable to any Training Library: Fastai, Pytorch-Lightning with more to come☆856Updated 3 months ago