Running local Language Language Models (LLM) to perform Retrieval-Augmented Generation (RAG)
β271Jan 2, 2026Updated 2 months ago
Alternatives and similar repositories for local-LLM-with-RAG
Users that are interested in local-LLM-with-RAG are comparing it to the libraries listed below
Sorting:
- π RAG Python Chat Bot: Gemini, Ollama, Streamlit with LangChain magic! π€π¬β40Feb 14, 2024Updated 2 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Siliconβ16May 8, 2025Updated 10 months ago
- Retrieval augmented generation demos with open-source DeepSeek, Llama, Qwen, Mistral, Gemmaβ42Aug 18, 2025Updated 6 months ago
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.β12Aug 27, 2023Updated 2 years ago
- a local RAG LLM with persistent database to query your PDFsβ16Feb 8, 2024Updated 2 years ago
- An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.β14May 14, 2025Updated 9 months ago
- Simple demo for chatting with a PDF - and optionally point the RAG implementation to a local LLMβ28Nov 29, 2023Updated 2 years ago
- Ingest files for retrieval augmented generation (RAG) with open-source Large Language Models (LLMs), all without 3rd parties or sensitiveβ¦β735Aug 12, 2024Updated last year
- A collection of AI related python scripts for things like training, RAG and agents.β29Mar 8, 2025Updated last year
- Chat effortlessly, execute commands, and interpret code with Llama3, Phi3, and more - your local AI assistant. Enjoy seamless interactionβ¦β83Jun 30, 2024Updated last year
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlitβ60Mar 14, 2024Updated last year
- Retrieval-Augmented Generation Chat Bot using Ollama, Langchain and Gradio.β36Mar 7, 2024Updated 2 years ago
- A set of scripts to build a RAG from the videos of a YouTube channelβ22Feb 2, 2024Updated 2 years ago
- Simple Chat UI as well as chat with documents using LLMs with Ollama (mistral model) locally, LangChaiin and Chainlitβ83Feb 17, 2024Updated 2 years ago
- Local PDF Chat Application with Mistral 7B LLM, Langchain, Ollama, and Streamlitβ152Jul 10, 2024Updated last year
- Codes for various problems solved using Finite Difference Method and Finite Volume Method.β12Apr 6, 2016Updated 9 years ago
- LM Studio: RAG (Retrieval-Augmented Generation) Local LLM vs GPT-4β21Jan 16, 2024Updated 2 years ago
- β21Jan 25, 2024Updated 2 years ago
- Active Response plugin. Osquery to execute wazuh/ossec active response plugins. You can write your own plugins, easy to plugβ11Jun 20, 2020Updated 5 years ago
- detecting the meotions using by analysing the sound of the person unsing pythonβ10Oct 7, 2019Updated 6 years ago
- This code implements a Local LLM Selector from the list of Local Installed Ollama LLMs for your specific user Queryβ105Nov 26, 2023Updated 2 years ago
- Retrieval Augmented Generation-based Agentic CrewAIβ25Nov 10, 2024Updated last year
- BigBertha is an architecture design that demonstrates how automated LLMOps (Large Language Models Operations) can be achieved on any Kubeβ¦β28Oct 27, 2023Updated 2 years ago
- An MCP server that provides persistent memory capabilities through a local knowledge graph, enabling AI assistants to maintain context acβ¦β19Dec 20, 2025Updated 2 months ago
- The application of a Physics Informed Neural Network on modelling the parameters of a Continuously Stirred Tank Reactor, based on the datβ¦β16Jun 25, 2024Updated last year
- Simple agent framework using Ollama tool callingβ10Aug 27, 2024Updated last year
- A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Modelsβ27Oct 2, 2025Updated 5 months ago
- Data extraction with LLM on CPUβ112Jan 8, 2024Updated 2 years ago
- β20Jun 16, 2025Updated 8 months ago
- Demos for AI assistants using NLUX, Next.js, React, and Node.jsβ17Jun 24, 2024Updated last year
- Simple GUI to load a PDF/Docx/txt file and have LM Studio Answer based off of it.β14Jul 31, 2024Updated last year
- This repo contains the code for the tutorial for using the CrewAI agent framework to generate Sales Reports based on Salesforce dataβ14Mar 16, 2024Updated last year
- A local rag demoβ33Mar 18, 2024Updated last year
- Advanced AI functionalities, including tool usage, context aware similarity with Ollama modelsβ19Aug 7, 2024Updated last year
- A simple github actions script to build a llamafile and uploads to huggingfaceβ17Jan 11, 2024Updated 2 years ago
- Karpathy's llama2.c transpiled to MLX for Apple Siliconβ14Dec 28, 2023Updated 2 years ago
- Google AI Chat is an AI-powered chat application built with Streamlit and Python. This application allows you to interact with Google AI β¦β15Dec 23, 2023Updated 2 years ago
- Powerful web application that combines Streamlit, LangChain, and Pinecone to simplify document analysis. Powered by OpenAI's GPT-3, RAG eβ¦β130Jul 4, 2024Updated last year
- "Just hoof it!" - A spotlight like interface to Ollamaβ63Apr 5, 2024Updated last year