A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,127Jan 21, 2026Updated 5 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆13,941Updated this week
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆4,092May 29, 2026Updated last month
- Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.☆47,360Jun 2, 2026Updated 3 weeks ago
- LlamaIndex is the leading document agent and OCR platform☆50,340Jun 20, 2026Updated last week
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆174,889Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Code for Stable Cascade☆6,548Jul 25, 2024Updated last year
- LLM inference in C/C++☆118,422Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆17,456Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆83,677Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,378May 19, 2026Updated last month
- Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience☆62,169Updated this week
- Inference code for CodeLlama models☆16,308Aug 12, 2024Updated last year
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆43,195Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆67,133Updated this week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- TensorRT Extension for Stable Diffusion Web UI☆1,994Jun 14, 2024Updated 2 years ago
- Large Language Model Text Generation Inference☆10,862Mar 21, 2026Updated 3 months ago
- A programming framework for agentic AI☆59,261Apr 15, 2026Updated 2 months ago
- Universal LLM Deployment Engine with ML Compilation☆22,863May 11, 2026Updated last month
- A lightweight coding agent for open models like Deepseek, Kimi, and Qwen☆64,111Jun 20, 2026Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆142,584Jun 19, 2026Updated last week
- Official inference library for Mistral models☆10,823Jun 16, 2026Updated last week
- Distribute and run LLMs with a single file.☆25,105Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,486May 1, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆29,534Sep 30, 2025Updated 8 months ago
- Record voice notes & transcribe, summarize, and get tasks☆2,143May 20, 2026Updated last month
- Perplexity Inspired Answer Engine☆5,026Apr 29, 2026Updated last month
- Starter-kit to build constrained agents with Nextjs, FastAPI and Langchain☆1,945Mar 18, 2026Updated 3 months ago
- The open source codebase powering HuggingChat☆10,786Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,430Jun 18, 2026Updated last week
- Large World Model -- Modeling Text and Video with Millions Context☆7,420Oct 19, 2024Updated last year
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,370May 27, 2025Updated last year
- The agent engineering platform.☆140,319Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 🙌 OpenHands: AI-Driven Development☆78,051Updated this week
- High-speed Large Language Model Serving for Local Deployment☆9,586May 11, 2026Updated last month
- Build, run, and manage agent platforms.☆40,861Updated this week
- DSPy: The framework for programming—not prompting—language models☆35,310Jun 18, 2026Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,881Aug 12, 2024Updated last year
- Platform for stateful agents: AI with advanced memory that can learn and self-improve over time.☆23,543Updated this week
- ☆1,037May 26, 2026Updated last month