vllm-project / aibrix
Cost-efficient and pluggable Infrastructure components for GenAI inference
☆3,561Updated this week
Alternatives and similar repositories for aibrix
Users that are interested in aibrix are comparing it to the libraries listed below
Sorting:
- A Datacenter Scale Distributed Inference Serving Framework☆4,011Updated this week
- vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization☆1,216Updated this week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,665Updated this week
- ☆3,323Updated last month
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation☆7,765Updated this week
- Redis for LLMs☆1,061Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆14,392Updated this week
- Sky-T1: Train your own O1 preview model within $450☆3,245Updated this week
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆1,348Updated this week
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆3,276Updated this week
- Agent S: an open agentic framework that uses computers like a human☆4,859Updated this week
- The python library for real-time communication☆3,891Updated this week
- Parlant is the open-source engine for controlled, compliant, and purposeful generative AI conversations. It gives you the power of LLMs w…☆2,728Updated this week
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,776Updated last month
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,659Updated this week
- ☆3,312Updated last week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆4,909Updated this week
- A lightweight data processing framework built on DuckDB and 3FS.☆4,624Updated 2 months ago
- Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer…☆3,295Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆8,084Updated this week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆5,672Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,242Updated last month
- FlashInfer: Kernel Library for LLM Serving☆2,966Updated this week
- The AI-native proxy server for agents. Arch handles the pesky low-level work in building agentic apps like calling specific tools, routin…☆2,570Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆2,361Updated last month
- Everything you need to build state-of-the-art foundation models, end-to-end.☆8,101Updated this week
- Vision agent☆4,731Updated last week
- Building AI agents, atomically☆3,615Updated last week
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.☆2,763Updated 2 months ago
- Democratizing Reinforcement Learning for LLMs☆3,236Updated this week