vcache-project / vCacheLinks
Reliable and Efficient Semantic Prompt Caching with vCache
☆58Updated last month
Alternatives and similar repositories for vCache
Users that are interested in vCache are comparing it to the libraries listed below
Sorting:
- MCP-enabled AI conversation engine with MCTS analysis, FastAPI backend, and async operations for building advanced LLM applications☆46Updated 6 months ago
- SCOPE ICLR 2025☆22Updated 4 months ago
- OSS RL environment + evals toolkit☆290Updated this week
- build your own vector database -- the littlest hnsw☆67Updated last year
- EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING☆492Updated 3 weeks ago
- ASTChunk is a Python toolkit for code chunking using Abstract Syntax Trees (ASTs), designed to create structurally sound and meaningful c…☆144Updated 7 months ago
- GraphBit is the world’s first enterprise-grade Agentic AI framework, built on a Rust core with a Python wrapper for unmatched speed, secu…☆502Updated this week
- DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval☆44Updated last week
- The minimal, ad-hoc way of plug and play NebulaGraph with pip install, even inside Colab Notebook!☆20Updated last year
- Open-source generalized AI agent for everyday task automations.☆454Updated 7 months ago
- Workflow Defined Engine☆24Updated 3 months ago
- RAG based agent with chDB(ClickHouse)☆22Updated 8 months ago
- The driver for LMCache core to run in vLLM☆60Updated last year
- TiDB Vector SDK for Python, including code examples. Join our Discord: https://discord.gg/XzSW23Jg9p☆61Updated 6 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆89Updated 3 weeks ago
- ☆93Updated last year
- Website with current metrics on the fastest AI models.☆42Updated last year
- Yet another coding assistant powered by LLM.☆16Updated last year
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆100Updated this week
- AgentTrace is a lightweight observability library to trace and evaluate agentic systems.☆41Updated 10 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 8 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆70Updated 8 months ago
- Blazing-fast CLI tool for developers to find documentation, code snippets, and answers instantly, online or offline with or without LLM a…☆77Updated this week
- Rust implementation of Surya☆65Updated 11 months ago
- ☆64Updated 8 months ago
- [DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing☆198Updated last month
- ThalamusDB: semantic query processing on multimodal data☆112Updated 5 months ago
- A simple, easy-to-hack Vector Database☆183Updated 3 weeks ago
- Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly a…☆89Updated 8 months ago
- Turn PostgreSQL into your search engine in a Pythonic way.☆51Updated 5 months ago