An LLM-powered advanced RAG pipeline built from scratch
☆857Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for rag-demystified
Users that are interested in rag-demystified are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A comprehensive guide to building RAG-based LLM applications for production.☆1,855Aug 2, 2024Updated last year
- Database system for AI-powered apps☆2,679May 17, 2024Updated last year
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆401Dec 2, 2023Updated 2 years ago
- ☆224Nov 15, 2023Updated 2 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,904May 17, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A real world full-stack application using LlamaIndex☆2,595Mar 12, 2025Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆867Jan 15, 2024Updated 2 years ago
- Structured Outputs☆13,657Mar 26, 2026Updated 3 weeks ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,591Sep 11, 2023Updated 2 years ago
- Ship RAG based LLM web apps in seconds.☆1,005Jan 29, 2024Updated 2 years ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,395Apr 8, 2026Updated last week
- Supercharge Your LLM Application Evaluations 🚀☆13,415Feb 24, 2026Updated last month
- Open-source tool to visualise your RAG 🔮☆1,216Jan 3, 2025Updated last year
- A guidance language for controlling large language models.☆21,397Apr 10, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DSPy: The framework for programming—not prompting—language models☆33,649Apr 13, 2026Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,646Jul 14, 2025Updated 9 months ago
- structured outputs for llms☆12,749Updated this week
- Developer APIs to Accelerate LLM Projects☆1,750Oct 18, 2024Updated last year
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆22,141Apr 12, 2026Updated last week
- RAG-Fusion: multi-query generation + Reciprocal Rank Fusion for better retrieval-augmented generation. Includes evaluation harness with N…☆921Mar 7, 2026Updated last month
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆6,030Apr 8, 2026Updated last week
- Efficient Retrieval Augmentation and Generation Framework☆1,776Jan 12, 2026Updated 3 months ago
- Seamlessly integrate LLMs as Python functions☆2,405Mar 11, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Build ChatGPT over your data, all with natural language☆6,532Apr 5, 2024Updated 2 years ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,032Feb 11, 2026Updated 2 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆7,764Nov 7, 2025Updated 5 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,211Jul 11, 2024Updated last year
- Turn expensive prompts into cheap fine-tuned models☆2,791May 25, 2024Updated last year
- AI-to-AI Testing | Simulation framework for LLM-based applications☆136Nov 7, 2023Updated 2 years ago
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆1,050Feb 27, 2025Updated last year
- LlamaIndex is the leading document agent and OCR platform☆48,601Updated this week
- VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…☆701May 16, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,990Jul 11, 2025Updated 9 months ago
- Querying local documents, powered by LLM☆649Jan 17, 2026Updated 3 months ago
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,815Apr 10, 2026Updated last week
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,030Nov 13, 2024Updated last year
- Robust recipes to align language models with human and AI preferences☆5,558Apr 8, 2026Updated last week
- Harness LLMs with Multi-Agent Programming☆3,967Apr 7, 2026Updated last week
- Zep | Examples, Integrations, & More☆4,454Apr 9, 2026Updated last week