The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
☆4,590Jun 29, 2026Updated this week
Alternatives and similar repositories for infinity
Users that are interested in infinity are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆83,679Updated this week
- Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search☆44,975Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆34,026Jun 22, 2026Updated last week
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,398Updated this week
- Retrieval and Retrieval-augmented LLMs☆11,887Apr 22, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆10,768Updated this week
- open-source agentic AI data assistant for the next generation of AI + Data products.☆19,310Updated this week
- LlamaIndex is the leading document agent and OCR platform☆50,533Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆84,877Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆32,812Updated this week
- Universal memory layer for AI Agents☆59,728Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆29,694Updated this week
- Question and Answer based on Anything.☆14,019Mar 24, 2025Updated last year
- Production-ready platform for agentic workflow development.☆146,705Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,881Sep 9, 2025Updated 9 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,928Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆15,002Jun 24, 2026Updated last week
- A composable and fully extensible C++ execution engine library for data management systems.☆4,162Updated this week
- A programming framework for agentic AI☆59,261Apr 15, 2026Updated 2 months ago
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆25,764Updated this week
- FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…☆28,711Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆8,844Jan 28, 2026Updated 5 months ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆51,475Jun 25, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- vsag is a vector indexing library used for similarity search.☆485Updated this week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆16,429Jun 25, 2026Updated last week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,878Jul 4, 2025Updated 11 months ago
- DSPy: The framework for programming—not prompting—language models☆35,605Updated this week
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.☆9,358Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆40,426Updated this week
- OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable…☆2,139Jul 5, 2025Updated 11 months ago
- Supercharge Your LLM Application Evaluations 🚀☆14,523Feb 24, 2026Updated 4 months ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,683Jun 22, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The agent engineering platform.☆140,319Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,857Mar 24, 2026Updated 3 months ago
- Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.☆23,543Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,906Nov 7, 2025Updated 7 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆72,482Jun 24, 2026Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,905Nov 19, 2025Updated 7 months ago
- Search infrastructure for AI☆28,614Updated this week