vllm-project / semantic-routerLinks
Intelligent Mixture-of-Models Router for Efficient LLM Inference
☆1,485Updated this week
Alternatives and similar repositories for semantic-router
Users that are interested in semantic-router are comparing it to the libraries listed below
Sorting:
- Manages Unified Access to Generative AI Services built on Envoy Gateway☆1,064Updated this week
- [EMNLP'25] s3 - ⚡ Efficient & Effective Search Agent Training via RL for RAG (Verifier-Powered RLVR for Search)☆697Updated last month
- Ultimate Context Engineering Infrastructure, starting from MCPs and Integrations☆760Updated last month
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆520Updated last month
- Klavis AI (YC X25): MCP integration layers that let AI agents use thousands of tools reliably.☆4,412Updated this week
- Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing re…☆1,450Updated this week
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆378Updated this week
- ❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents☆285Updated last month
- ☆480Updated last week
- This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines s…☆493Updated last week
- R-KV: Redundancy-aware KV Cache Compression for Reasoning Models☆1,122Updated last month
- Easiest and laziest way for building multi-agent LLMs applications.☆2,742Updated last week
- ☆453Updated last week
- ☆867Updated last week
- Build multimodal language agents for fast prototype and production☆2,555Updated 6 months ago
- [arXiv'25] EraRAG: Efficient and Incremental Retrieval-Augmented Generation for Growing Corpora☆154Updated this week
- LightAgent: Lightweight AI agent framework with memory, tools & tree-of-thought. Supports multi-agent collaboration, self-learning, and m…☆297Updated last week
- [NeurIPS 2025 Main] SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications☆769Updated last week
- Open source AI terminal and SSH Client for EC2, Database and Kubernetes.☆1,299Updated this week
- ☆1,114Updated 2 months ago
- Complex Reasoning Rag System☆171Updated this week
- Pytorch Library for Relational Table Learning with LLMs.☆430Updated last week
- "VideoRAG: Chat with Your Videos"☆1,130Updated 2 weeks ago
- A desktop MCP client designed as a tool unitary utility integration, accelerating AI adoption through the Model Context Protocol (MCP) an…☆1,078Updated this week
- AppPlatform 是一个前沿的大模型应用工程,旨在通过集成的声明式编程和低代码 配置工具,简化和优化大模型的训练与推理应用的开发过程。本工程为软件工程师和产品经理提供一个强大的、可扩展的环境,以支持从概念到部署的全流程 AI 应用开发。☆1,273Updated last week
- ScaleCUA is the open-sourced computer use agents that can operate on corss-platform environments (Windows, macOS, Ubuntu, Android).☆279Updated this week
- ☆228Updated last week
- [COLM'25] DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning☆635Updated 3 months ago
- In-depth study of the graphrag☆1,418Updated 2 months ago
- Extending eBPF Programmability and Observability to GPUs (merged into https://github.com/eunomia-bpf/bpftime)☆232Updated last week