This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.
☆584Apr 30, 2026Updated this week
Alternatives and similar repositories for rag
Users that are interested in rag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The AI-Q NVIDIA Blueprint is an open reference example for building intelligent AI agents that connect to your enterprise data, reason us…☆490Apr 24, 2026Updated last week
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…☆251Apr 9, 2026Updated 3 weeks ago
- The Retail Shopping Assistant is an AI-powered blueprint that provides a comprehensive interface for an intelligent retail shopping advis…☆49Updated this week
- Route LLM requests to the best model for the task at hand.☆259Updated this week
- ☆39Mar 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆2,243Updated this week
- Pipecat framework based orchestrator for building real-time, voice-enabled, and multimodal conversational AI agents☆53Mar 3, 2026Updated last month
- NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extracti…☆2,915Updated this week
- An NVIDIA AI Workbench example project for an Agentic Retrieval Augmented Generation (RAG)☆160Jan 16, 2026Updated 3 months ago
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,969Mar 30, 2026Updated last month
- Financial CrewAI Agents (LangChain, YF Tools, Ai Crew, Groq Inference)☆32Jul 12, 2024Updated last year
- A Model Context Protocol (MCP) server for taking screenshots and reading console logs of web pages using Playwright.☆22Oct 2, 2025Updated 7 months ago
- GitHub Action to calculate and publish lines of code report in GitHub Actions as Checksuite.☆15Mar 7, 2024Updated 2 years ago
- ☆14Apr 22, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- All public LiveKit repos as a common repo to make searching and LLM inference easier.☆29Apr 16, 2026Updated 2 weeks ago
- Real time conversatio co-pilot able to generate suggestions from recorded audio☆13Mar 1, 2024Updated 2 years ago
- ☆14Apr 21, 2024Updated 2 years ago
- Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A☆622Updated this week
- Voice Agent Framework for Conversational AI☆83May 5, 2025Updated 11 months ago
- ☆199Updated this week
- The NVIDIA NeMo Agent Toolkit UI streamlines interacting with NeMo Agent Toolkit workflows in an easy-to-use web application.☆96Apr 25, 2026Updated last week
- ☆26Mar 1, 2026Updated 2 months ago
- VitalPBX - AI Agent with OpenAI ChatGPT, Whisper and Microsoft Azure AI Speech (TTS)☆20Jan 24, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Evaluate how vLLM and SGLang perform when running a small LLM model on a mid-range NVIDIA GPU☆21Apr 20, 2026Updated last week
- iOS App utilizing the RabbitKit Library to emulate the Rabbit R1 software experience☆16Aug 28, 2024Updated last year
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆368Aug 12, 2025Updated 8 months ago
- Contains Python Code for using ML/DL Models from HF☆29Oct 9, 2025Updated 6 months ago
- Interactive RAG workbench to demonstrate Redis features and enhancements for improving accuracy, speed, cost, and reliability of LLM appl…☆28Oct 6, 2025Updated 6 months ago
- ☆27Aug 4, 2025Updated 8 months ago
- User interface for the Local Operator on-device agent environment. Local Operator is an AI agents platform that gives you an integrated …☆25Feb 11, 2026Updated 2 months ago
- ☆10Nov 16, 2024Updated last year
- ☆30May 30, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Queries related to creating an AI Email Assistant with MindsDB and GPT4.☆12Oct 17, 2024Updated last year
- This is a simple example of how to serve a DeepSeek model with Azure ML.☆10Feb 10, 2025Updated last year
- Code-Langchain☆44Feb 20, 2024Updated 2 years ago
- A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM☆3,119Jan 21, 2026Updated 3 months ago
- Examples and Demos using the Cohere APIs☆23Nov 3, 2023Updated 2 years ago
- Sample data conversion pipeline for importing data into Amazon Personalize.☆19Feb 13, 2019Updated 7 years ago