infiniflow / infinity
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text
☆2,669Updated this week
Related projects ⓘ
Alternatives and complementary repositories for infinity
- A @ClickHouse fork that supports high-performance vector search and full-text search.☆868Updated last week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆3,505Updated last month
- A blazing fast inference solution for text embeddings models☆2,852Updated 2 weeks ago
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆5,442Updated this week
- Retrieval and Retrieval-augmented LLMs☆7,655Updated this week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!☆4,775Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆6,127Updated this week
- Build resilient language agents as graphs.☆6,754Updated this week
- Modeling, training, eval, and inference code for OLMo☆4,645Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆4,678Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆7,261Updated this week
- Building a quick conversation-based search demo with Lepton AI.☆7,842Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆19,247Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,334Updated this week
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,104Updated 5 months ago
- High-speed Large Language Model Serving on PCs with Consumer-grade GPUs☆7,966Updated 2 months ago
- [COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild☆3,996Updated this week
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,490Updated 2 months ago
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆23,277Updated this week
- A unified evaluation framework for large language models☆2,469Updated 3 weeks ago
- One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure☆1,870Updated 2 weeks ago
- The Memory layer for your AI apps☆22,907Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,063Updated 2 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,122Updated this week
- Tools for merging pretrained large language models.☆4,830Updated this week
- Go ahead and axolotl questions☆7,950Updated this week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆3,983Updated last week
- Parse files for optimal RAG☆3,199Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,195Updated 4 months ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆16,216Updated last month