A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,110Jan 21, 2026Updated last month
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,938Updated this week
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,802Feb 5, 2026Updated 3 weeks ago
- The definitive Web UI for local AI, with powerful features and easy setup.☆46,091Feb 3, 2026Updated 3 weeks ago
- LlamaIndex is the leading document agent and OCR platform☆47,210Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,220Nov 3, 2025Updated 3 months ago
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,807Updated this week
- LLM inference in C/C++☆95,726Updated this week
- Official Code for Stable Cascade☆6,577Jul 25, 2024Updated last year
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆163,045Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆54,878Feb 21, 2026Updated last week
- Inference code for CodeLlama models☆16,346Aug 12, 2024Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆52,724Updated this week
- Large Language Model Text Generation Inference☆10,774Jan 8, 2026Updated last month
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆40,672Updated this week
- Official inference library for Mistral models☆10,683Nov 21, 2025Updated 3 months ago
- Universal LLM Deployment Engine with ML Compilation☆22,061Feb 18, 2026Updated last week
- A programming framework for agentic AI☆54,683Jan 22, 2026Updated last month
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,182Updated this week
- High-speed Large Language Model Serving for Local Deployment☆8,729Jan 24, 2026Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,478Aug 12, 2024Updated last year
- Distribute and run LLMs with a single file.☆23,742Updated this week
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,326Apr 8, 2024Updated last year
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,918Sep 30, 2025Updated 4 months ago
- Go ahead and axolotl questions☆11,335Updated this week
- TensorRT Extension for Stable Diffusion Web UI☆1,996Jun 14, 2024Updated last year
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆44,662Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,574Jul 14, 2025Updated 7 months ago
- A natural language interface for computers☆62,427Feb 9, 2026Updated 2 weeks ago
- The open source codebase powering HuggingChat☆10,523Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,381Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- 🙌 OpenHands: AI-Driven Development☆68,154Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,414Jun 2, 2025Updated 8 months ago
- The programming language for agentic software. Build, run, and manage multi-agent systems at scale.☆38,104Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆124,763Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆23,658Updated this week
- Perplexity Inspired Answer Engine☆5,015Jun 27, 2025Updated 8 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,399Oct 19, 2024Updated last year