NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,077Updated 7 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- ☆1,009Updated 9 months ago
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,165Updated last year
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,609Updated this week
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,439Updated last year
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,326Updated last year
- ☆1,802Updated 2 weeks ago
- Granite Code Models: A Family of Open Foundation Models for Code Intelligence☆1,236Updated 4 months ago
- Home of StarCoder2!☆1,981Updated last year
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,069Updated this week
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,052Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,570Updated 5 months ago
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,562Updated this week
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆800Updated last year
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,308Updated 8 months ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,983Updated 6 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,140Updated 2 months ago
- ☆3,035Updated last year
- Build and run containers leveraging NVIDIA GPUs☆3,806Updated this week
- ☆1,549Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,875Updated last year
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,303Updated 3 months ago
- TensorRT Extension for Stable Diffusion Web UI☆1,988Updated last year
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,836Updated 10 months ago
- Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend…☆1,954Updated last year
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,685Updated last year
- LLM powered development for VSCode☆1,304Updated last year
- Training LLMs with QLoRA + FSDP☆1,528Updated last year
- Official Code for Stable Cascade☆6,583Updated last year
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,808Updated 7 months ago
- ☆1,028Updated last year