NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,084Updated 8 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,665Updated last week
- ☆1,015Updated 10 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,152Updated last week
- ☆1,551Updated last year
- Home of StarCoder2!☆2,008Updated last year
- Local AI API Platform☆2,764Updated 5 months ago
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,385Updated last week
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆799Updated last year
- ☆1,845Updated last week
- LM Studio CLI☆3,969Updated this week
- Official Code for Stable Cascade☆6,588Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,388Updated last week
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,309Updated 4 months ago
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,461Updated last year
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,170Updated last year
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,313Updated 10 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,643Updated this week
- Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM☆1,456Updated 8 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Updated 3 months ago
- Official inference library for Mistral models☆10,600Updated 3 weeks ago
- Gemma open-weight LLM library, from Google DeepMind☆3,879Updated last month
- The official PyTorch implementation of Google's Gemma models☆5,586Updated 6 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆686Updated last year
- Large World Model -- Modeling Text and Video with Millions Context☆7,383Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,407Updated last year
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,809Updated 2 weeks ago
- Perplexity Inspired Answer Engine☆5,009Updated 5 months ago
- Training LLMs with QLoRA + FSDP☆1,534Updated last year
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,568Updated 6 months ago
- Inference Llama 2 in one file of pure 🔥☆2,115Updated 2 weeks ago