vllm-project / aibrix
Cost-efficient and pluggable Infrastructure components for GenAI inference
☆3,341Updated this week
Alternatives and similar repositories for aibrix:
Users that are interested in aibrix are comparing it to the libraries listed below
- vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization☆926Updated this week
- A Datacenter Scale Distributed Inference Serving Framework☆3,377Updated this week
- The fast, Pythonic way to build Model Context Protocol servers 🚀☆1,993Updated last week
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆4,593Updated this week
- A suite of tools to develop RAG, semantic search, and other AI applications more easily with PostgreSQL☆4,596Updated this week
- Sky-T1: Train your own O1 preview model within $450☆3,167Updated last week
- A lightweight data processing framework built on DuckDB and 3FS.☆4,444Updated 3 weeks ago
- Agent Framework / shim to use Pydantic with LLMs☆7,819Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,474Updated last week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,073Updated last week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,886Updated 3 weeks ago
- The official Python SDK for Model Context Protocol servers and clients☆6,639Updated this week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including O…☆4,160Updated this week
- Build effective agents using Model Context Protocol and simple workflow patterns☆2,233Updated last week
- Building AI agents, atomically☆3,199Updated this week
- Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspect…☆3,893Updated this week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆3,725Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,763Updated 7 months ago
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,629Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆5,994Updated this week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆5,964Updated this week
- ☆2,574Updated last week
- ☆3,610Updated last month
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆6,198Updated this week
- MoBA: Mixture of Block Attention for Long-Context LLMs☆1,696Updated 3 weeks ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.☆2,676Updated 3 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆4,160Updated this week
- Pocket Flow: 100-line LLM framework. Let Agents build Agents!☆1,676Updated this week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆4,255Updated last week
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,179Updated last month