⚡VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT, TorchScript, ONNX and TVM.
☆1,182Nov 30, 2022Updated 3 years ago
Alternatives and similar repositories for voltaML
Users that are interested in voltaML are comparing it to the libraries listed below
Sorting:
- Beautiful and Easy to use Stable Diffusion WebUI☆1,001Jun 19, 2024Updated last year
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,706Jan 12, 2026Updated last month
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,585Jan 28, 2026Updated last month
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,687Oct 23, 2024Updated last year
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆561Dec 4, 2023Updated 2 years ago
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆719Sep 13, 2023Updated 2 years ago
- Efficient few-shot learning with Sentence Transformers☆2,688Dec 11, 2025Updated 2 months ago
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,305Feb 9, 2026Updated 3 weeks ago
- Sparsity-aware deep learning inference runtime for CPUs☆3,159Jun 2, 2025Updated 9 months ago
- Fast finetuning using a booster model that puts the initial state to a local minimum☆113Aug 29, 2023Updated 2 years ago
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆748Oct 4, 2023Updated 2 years ago
- Containers for machine learning☆9,252Updated this week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,741Jan 8, 2024Updated 2 years ago
- Foundation Architecture for (M)LLMs☆3,135Apr 11, 2024Updated last year
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆835Aug 13, 2025Updated 6 months ago
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,590Updated this week
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,630Feb 17, 2026Updated last week
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,301Mar 27, 2025Updated 11 months ago
- The simplest way to serve AI/ML models in production☆1,122Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,875Feb 23, 2026Updated last week
- Large-scale model inference.☆627Sep 12, 2023Updated 2 years ago
- A collection of libraries to optimise AI model performances☆8,353Jul 22, 2024Updated last year
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,724Updated this week
- An implementation of a server for the Stability AI Stable Diffusion API☆172Jan 6, 2023Updated 3 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,289Dec 22, 2025Updated 2 months ago
- 🦘 Explore multimedia datasets at scale☆1,062Dec 7, 2024Updated last year
- An open-source AutoML Library based on PyTorch☆309Jan 5, 2026Updated last month
- An open-source ML pipeline development platform☆997Jan 9, 2025Updated last year
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,982Dec 28, 2025Updated 2 months ago
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two line…☆669Feb 22, 2025Updated last year
- An open-source efficient deep learning framework/compiler, written in python.☆737Sep 4, 2025Updated 5 months ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,382Oct 28, 2024Updated last year
- The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX☆423Apr 16, 2024Updated last year
- Algorithms for explaining machine learning models☆2,612Oct 17, 2025Updated 4 months ago
- Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.☆6,017Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆7,997Updated this week
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated last month
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆739Updated this week
- Transformer related optimization, including BERT, GPT☆6,394Mar 27, 2024Updated last year