mickymultani / nvidia-NIM-RAG
Project demonstrates the power and simplicity of NVIDIA NIM (NVIDIA Inference Model), a suite of optimized cloud-native microservices, by setting up and running a Retrieval-Augmented Generation (RAG) pipeline.
☆12Updated 11 months ago
Alternatives and similar repositories for nvidia-NIM-RAG:
Users that are interested in nvidia-NIM-RAG are comparing it to the libraries listed below
- LangChain + LiteLLM that works☆37Updated this week
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated 9 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆38Updated last month
- ☆22Updated 7 months ago
- 🧠 Mem4AI: A LLM Friendly memory management library.☆18Updated 3 months ago
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- Simple Chainlit UI for running llms from Groq and LangChain☆17Updated 11 months ago
- Run language models on consumer hardware.☆25Updated last year
- A RAG powered web search with Tavily, LangChain, Mistral AI ( leveraging groq LPU) . The full stack web app build in Databutton.☆34Updated 11 months ago
- ☆54Updated 3 weeks ago
- HyDE based RAG using NVIDIA NIM.☆15Updated 11 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 4 months ago
- ☆49Updated 8 months ago
- This is an AI agent framework that allows the user to manage a team of agents focused on a common goal.☆27Updated 7 months ago
- Embed anything.☆29Updated 8 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆18Updated 5 months ago
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆49Updated last year
- MCP Server implementation for Claude☆19Updated 2 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- multi agent team with coding and data analysis capability to structure real estate investment plans and help with decision making.☆10Updated 8 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆22Updated last year
- Streamlit Web UI for AGiXT☆26Updated 4 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆30Updated last week
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated last year
- Example LangGraph flow that does "competitor analysis" on the web.☆23Updated 8 months ago
- Tutorial for DSPy☆23Updated 9 months ago