deepset-ai / haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
☆17,780Updated this week
Related projects ⓘ
Alternatives and complementary repositories for haystack
- LlamaIndex is a data framework for your LLM applications☆36,820Updated this week
- DSPy: The framework for programming—not prompting—language models☆18,885Updated this week
- 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows☆9,375Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆20,632Updated this week
- the AI-native open-source embedding database☆15,448Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆9,182Updated this week
- Build Conversational AI in minutes ⚡️☆7,224Updated this week
- Structured Text Generation☆9,487Updated this week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆11,533Updated this week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆34,030Updated this week
- Train transformer language models with reinforcement learning.☆10,086Updated this week
- A guidance language for controlling large language models.☆19,118Updated last week
- Large Language Model Text Generation Inference☆9,122Updated this week
- 🦜🔗 Build context-aware reasoning applications☆95,070Updated this week
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,248Updated 2 months ago
- Open-source vector similarity search for Postgres☆12,643Updated 3 weeks ago
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,238Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30,423Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆3,986Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llam…☆6,598Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,059Updated 5 months ago
- The Memory layer for your AI apps☆22,875Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆12,427Updated last month
- Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory☆18,263Updated this week
- Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.☆10,079Updated this week
- Instruct-tune LLaMA on consumer hardware☆18,653Updated 3 months ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆36,993Updated this week
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆20,199Updated 3 months ago
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆12,838Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,971Updated this week