vllm-project / aibrixLinks

Cost-efficient and pluggable Infrastructure components for GenAI inference

☆4,414

Alternatives and similar repositories for aibrix

Users that are interested in aibrix are comparing it to the libraries listed below

Sorting:

LMCache / LMCache
Supercharge Your LLM with the Fastest KV Cache Layer
☆6,149Updated this week
vllm-project / production-stack
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
☆1,956Updated last week
ai-dynamo / dynamo
A Datacenter Scale Distributed Inference Serving Framework
☆5,490Updated this week
llm-d / llm-d
Achieve state of the art inference performance with modern accelerators on Kubernetes
☆2,067Updated this week
vllm-project / llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
☆2,238Updated last week
kvcache-ai / Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆4,283Updated this week
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,533Updated 6 months ago
deepseek-ai / smallpond
A lightweight data processing framework built on DuckDB and 3FS.
☆4,838Updated 8 months ago
ray-project / kuberay
A toolkit to run Ray applications on Kubernetes
☆2,142Updated this week
GeeeekExplorer / nano-vllm
Nano vLLM
☆9,095Updated 2 weeks ago
flashinfer-ai / flashinfer
FlashInfer: Kernel Library for LLM Serving
☆4,099Updated this week
ray-project / llmperf
LLMPerf is a library for validating and benchmarking LLMs
☆1,046Updated 11 months ago
Lightning-AI / LitServe
Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.
☆3,711Updated last week
vllm-project / guidellm
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
☆708Updated this week
SylphAI-Inc / AdalFlow
AdalFlow: The library to build & auto-optimize LLM applications.
☆3,873Updated last month
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆20,253Updated this week
kubeai-project / kubeai
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-te…
☆1,092Updated last week
openlit / openlit
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…
☆2,028Updated last week
NVIDIA / NeMo-Agent-Toolkit
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
☆1,514Updated this week
OpenPipe / ART
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…
☆7,848Updated this week
mlc-ai / xgrammar
Fast, Flexible and Portable Structured Generation
☆1,391Updated this week
ModelTC / LightLLM
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalabili…
☆3,730Updated this week
NVIDIA / KAI-Scheduler
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆933Updated this week
ServerlessLLM / ServerlessLLM
Serverless LLM Serving for Everyone.
☆603Updated this week
deepseek-ai / open-infra-index
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
☆7,926Updated 6 months ago
rllm-org / rllm
Democratizing Reinforcement Learning for LLMs
☆4,737Updated this week
volcengine / verl
verl: Volcano Engine Reinforcement Learning for LLMs
☆16,159Updated this week
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆3,408Updated 2 months ago
thinking-machines-lab / tinker-cookbook
Post-training with Tinker
☆2,148Updated this week
deepseek-ai / 3FS
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
☆9,467Updated 3 weeks ago