infiniflow / infinityLinks
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
☆3,967Updated this week
Alternatives and similar repositories for infinity
Users that are interested in infinity are comparing it to the libraries listed below
Sorting:
- A blazing fast inference solution for text embeddings models☆3,871Updated this week
- Retrieval and Retrieval-augmented LLMs☆10,302Updated 3 weeks ago
- A @ClickHouse fork that supports high-performance vector search and full-text search.☆980Updated 6 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆10,686Updated 2 weeks ago
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆8,352Updated this week
- Developer-friendly, embedded retrieval engine for multimodal AI. Search More; Manage Less.☆7,270Updated this week
- OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable…☆1,803Updated last month
- SGLang is a fast serving framework for large language models and vision language models.☆16,576Updated this week
- Web UI for Milvus Vector Database☆2,141Updated this week
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,818Updated last month
- Benchmark designed to evaluate the performance and cost-effectiveness of vector databases.☆836Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆6,832Updated this week
- SoTA LLM for converting natural language questions to SQL queries☆3,850Updated last year
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,347Updated 2 weeks ago
- A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in T…☆1,865Updated last month
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,268Updated last week
- Simple, scalable AI model deployment on GPU clusters☆3,230Updated this week
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆2,664Updated last month
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆7,601Updated this week
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆2,598Updated this week
- High-speed Large Language Model Serving for Local Deployment☆8,301Updated last week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,623Updated 3 weeks ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,178Updated 5 months ago
- A simple, easy-to-hack GraphRAG implementation☆3,192Updated 2 weeks ago
- SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradi…☆4,020Updated this week
- C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)☆2,979Updated last year
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆14,144Updated this week
- Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.☆3,006Updated 2 months ago
- TuGraph: A High Performance Graph Database.☆1,613Updated 3 weeks ago
- Supercharge Your LLM Application Evaluations 🚀☆10,261Updated this week