milvus-io / milvus-modelLinks
A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.
☆42Updated 2 months ago
Alternatives and similar repositories for milvus-model
Users that are interested in milvus-model are comparing it to the libraries listed below
Sorting:
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆83Updated 4 months ago
- A curated list of awesome Milvus projects and resources.☆32Updated 2 years ago
- RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…☆25Updated 2 months ago
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆45Updated 7 months ago
- An external retriever for GPTs implemented with Zilliz Cloud Pipelines, a more flexible and economic alternative to default GPTs knowledg…☆16Updated last year
- ☆62Updated 10 months ago
- Framework for benchmarking fully-managed vector databases☆79Updated 8 months ago
- Evaluation for AI apps and agent☆41Updated last year
- Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system su…☆48Updated 2 months ago
- ☆24Updated 4 months ago
- Elasticsearch integration into LangChain☆57Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆85Updated 3 weeks ago
- ☆17Updated last month
- Query Expension for Better Query Embedding using LLMs☆51Updated 3 months ago
- Evaluation of bm42 sparse indexing algorithm☆67Updated 10 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 8 months ago
- DSPY on action with OpenSource LLMs.☆71Updated last year
- ☆34Updated last year
- ☆41Updated 5 months ago
- ☆18Updated 5 months ago
- Python API for https://vespa.ai, the open big data serving engine☆126Updated this week
- xllamacpp - a Python wrapper of llama.cpp☆40Updated last week
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆56Updated 7 months ago
- ☆60Updated last year
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆75Updated 2 months ago
- Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)☆105Updated last month
- ☆24Updated 5 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 10 months ago
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆44Updated 6 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated 3 weeks ago