NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,004Updated 2 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆10,734Updated this week
- Home of StarCoder2!☆1,921Updated last year
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,169Updated 8 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,312Updated last year
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆798Updated 10 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,207Updated last week
- ☆3,692Updated last month
- Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM☆1,432Updated 2 months ago
- ☆2,956Updated 9 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,466Updated this week
- Tools for merging pretrained large language models.☆5,809Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,054Updated last week
- ☆1,816Updated last week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,404Updated 6 months ago
- Large Language Model Text Generation Inference☆10,216Updated this week
- Large-scale LLM inference engine☆1,446Updated this week
- ☆976Updated 4 months ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,882Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,867Updated 2 months ago
- LLM powered development for VSCode☆1,302Updated 11 months ago
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,303Updated 4 months ago
- On-device AI across mobile, embedded and edge for PyTorch☆2,953Updated this week
- PyTorch native post-training library☆5,257Updated this week
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,358Updated 9 months ago
- LM Studio CLI☆3,184Updated last week
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,183Updated last week
- The official PyTorch implementation of Google's Gemma models☆5,481Updated 2 weeks ago
- Training LLMs with QLoRA + FSDP☆1,484Updated 7 months ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,760Updated 5 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,019Updated 7 months ago