⚡VoltaML is a lightweight library to convert and run your ML/DL deep learning models in high performance inference runtimes like TensorRT, TorchScript, ONNX and TVM.
☆1,180Nov 30, 2022Updated 3 years ago
Alternatives and similar repositories for voltaML
Users that are interested in voltaML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Beautiful and Easy to use Stable Diffusion WebUI☆1,000Jun 19, 2024Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,585Jan 28, 2026Updated 2 months ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,716Updated this week
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,688Oct 23, 2024Updated last year
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆717Sep 13, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Real-time inference for Stable Diffusion - 0.88s latency. Covers AITemplate, nvFuser, TensorRT, FlashAttention. Join our Discord communty…☆560Dec 4, 2023Updated 2 years ago
- Fast finetuning using a booster model that puts the initial state to a local minimum☆113Aug 29, 2023Updated 2 years ago
- 🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…☆3,354Apr 2, 2026Updated last week
- PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.☆842Aug 13, 2025Updated 7 months ago
- PyQt6 GUI to queue and render images and videos using ComfyUI Workflows☆18Mar 5, 2025Updated last year
- Efficient few-shot learning with Sentence Transformers☆2,710Apr 2, 2026Updated last week
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆748Oct 4, 2023Updated 2 years ago
- Desktop AI Generator☆102Apr 14, 2023Updated 2 years ago
- Containers for machine learning☆9,375Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Large-scale model inference.☆628Sep 12, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,743Jan 8, 2024Updated 2 years ago
- The simplest way to serve AI/ML models in production☆1,134Apr 3, 2026Updated last week
- Sparsity-aware deep learning inference runtime for CPUs☆3,163Jun 2, 2025Updated 10 months ago
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,736Apr 4, 2026Updated last week
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 2 months ago
- https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.☆1,305Mar 27, 2025Updated last year
- An implementation of a server for the Stability AI Stable Diffusion API☆172Jan 6, 2023Updated 3 years ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆484Oct 23, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Foundation Architecture for (M)LLMs☆3,135Apr 11, 2024Updated 2 years ago
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,612Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,925Updated this week
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,639Feb 17, 2026Updated last month
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,329Apr 5, 2026Updated last week
- 🦘 Explore multimedia datasets at scale☆1,065Dec 7, 2024Updated last year
- An open-source efficient deep learning framework/compiler, written in python.☆740Sep 4, 2025Updated 7 months ago
- ☆35Jan 12, 2024Updated 2 years ago
- An open-source ML pipeline development platform☆995Jan 9, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- General fine tuning for Stable Diffusion☆509Apr 30, 2023Updated 2 years ago
- Command Line Interface for Hugging Face Inference Endpoints☆65Apr 10, 2024Updated 2 years ago
- An open-source AutoML Library based on PyTorch☆308Updated this week
- Official implementation of "Composer: Creative and Controllable Image Synthesis with Composable Conditions"☆1,558Dec 26, 2023Updated 2 years ago
- Active Learning for Text Classification in Python☆637Apr 1, 2026Updated last week
- A collection of libraries to optimise AI model performances☆8,349Jul 22, 2024Updated last year
- Accessible large language models via k-bit quantization for PyTorch.☆8,107Updated this week