The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
☆4,400Feb 24, 2026Updated this week
Alternatives and similar repositories for infinity
Users that are interested in infinity are comparing it to the libraries listed below
Sorting:
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆73,900Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,031Feb 20, 2026Updated last week
- Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search☆43,056Updated this week
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,072Updated this week
- Retrieval and Retrieval-augmented LLMs☆11,329Dec 15, 2025Updated 2 months ago
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆9,141Updated this week
- AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents☆18,161Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- LlamaIndex is the leading document agent and OCR platform☆47,210Updated this week
- Universal memory layer for AI Agents☆47,994Feb 23, 2026Updated last week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆23,905Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆29,102Updated this week
- Question and Answer based on Anything.☆13,859Mar 24, 2025Updated 11 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,618Updated this week
- Production-ready platform for agentic workflow development.☆130,750Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,074Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- A programming framework for agentic AI☆54,956Jan 22, 2026Updated last month
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,295Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆8,574Jan 28, 2026Updated last month
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,860Sep 9, 2025Updated 5 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,777Jul 4, 2025Updated 7 months ago
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆27,170Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,381Updated this week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆15,690Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,340Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,693Nov 7, 2025Updated 3 months ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,210Feb 22, 2026Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,451Feb 16, 2026Updated 2 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,321Feb 23, 2026Updated last week
- Build, run, manage agentic software at scale.☆38,276Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,688Feb 5, 2026Updated 3 weeks ago
- Supercharge Your LLM Application Evaluations 🚀☆12,736Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆22,415Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆67,659Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,645Nov 19, 2025Updated 3 months ago
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.☆22,726Feb 2, 2026Updated last month
- Open-source search and retrieval database for AI applications.☆26,269Updated this week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆54,870Updated this week