A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,125Jan 21, 2026Updated 4 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆13,793Updated this week
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆4,056May 29, 2026Updated last week
- Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.☆47,265Jun 2, 2026Updated last week
- LlamaIndex is the leading document agent and OCR platform☆49,909Updated this week
- Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆173,296Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Code for Stable Cascade☆6,550Jul 25, 2024Updated last year
- LLM inference in C/C++☆114,217Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆17,292Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆81,909Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,343May 19, 2026Updated 2 weeks ago
- Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience☆61,223Updated this week
- Inference code for CodeLlama models☆16,318Aug 12, 2024Updated last year
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆42,869Jun 2, 2026Updated last week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆65,620Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TensorRT Extension for Stable Diffusion Web UI☆1,994Jun 14, 2024Updated last year
- Large Language Model Text Generation Inference☆10,859Mar 21, 2026Updated 2 months ago
- A programming framework for agentic AI☆58,726Apr 15, 2026Updated last month
- Universal LLM Deployment Engine with ML Compilation☆22,770May 11, 2026Updated 3 weeks ago
- A natural language interface for computers☆63,803Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆139,661Jun 2, 2026Updated last week
- Official inference library for Mistral models☆10,813Apr 20, 2026Updated last month
- Distribute and run LLMs with a single file.☆24,611Jun 1, 2026Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,479May 1, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆28,333Sep 30, 2025Updated 8 months ago
- Record voice notes & transcribe, summarize, and get tasks☆2,140May 20, 2026Updated 2 weeks ago
- Perplexity Inspired Answer Engine☆5,024Apr 29, 2026Updated last month
- Starter-kit to build constrained agents with Nextjs, FastAPI and Langchain☆1,944Mar 18, 2026Updated 2 months ago
- The open source codebase powering HuggingChat☆10,754Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,397Jun 1, 2026Updated last week
- Large World Model -- Modeling Text and Video with Millions Context☆7,415Oct 19, 2024Updated last year
- 🙌 OpenHands: AI-Driven Development☆75,701Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆77,354May 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The agent engineering platform.☆138,777Updated this week
- High-speed Large Language Model Serving for Local Deployment☆9,522May 11, 2026Updated 3 weeks ago
- Build, run, and manage agent platforms.☆40,558Updated this week
- DSPy: The framework for programming—not prompting—language models☆34,811Updated this week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆23,204May 14, 2026Updated 3 weeks ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,848Aug 12, 2024Updated last year
- ☆1,039May 26, 2026Updated last week