milvus-io / milvus-haystack
β13Updated last month
Alternatives and similar repositories for milvus-haystack:
Users that are interested in milvus-haystack are comparing it to the libraries listed below
- π Unstructured Data Connectors for Haystack 2.0β16Updated last year
- A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.β31Updated last week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β56Updated 3 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β145Updated 4 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwardsβ130Updated this week
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.β55Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β63Updated 2 months ago
- A Python library to chunk/group your texts based on semantic similarity.β92Updated 6 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β72Updated this week
- π A list of Haystack Integrations, maintained by the community or deepset.β77Updated this week
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welcβ¦β20Updated 10 months ago
- An integration of Qdrant ANN vector database backend with Haystackβ44Updated 3 weeks ago
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β30Updated 10 months ago
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding modeβ¦β22Updated 4 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.β39Updated 6 months ago
- A specification for OpenInference, a semantic mapping of ML inferencesβ45Updated 9 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]β76Updated last week
- β91Updated 4 months ago
- GLiNER model in a FastAPI microservice.β35Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ44Updated 4 months ago
- DSPY on action with OpenSource LLMs.β63Updated 9 months ago
- β25Updated this week
- β18Updated 3 months ago
- Python client library for improving your LLM app accuracyβ96Updated this week
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β79Updated last month
- A framework for evaluating function calls made by LLMsβ36Updated 6 months ago
- GPT-4 Level Conversational QA Trained In a Few Hoursβ58Updated 5 months ago
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilotβ41Updated 3 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated 3 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β68Updated last month