NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,097Updated 9 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,172Updated last year
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,695Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,155Updated last week
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆799Updated last year
- Agentic components of the Llama Stack APIs☆4,288Updated 5 months ago
- ☆1,025Updated 11 months ago
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆2,219Updated last week
- Local AI API Platform☆2,763Updated 6 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,328Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,136Updated 2 months ago
- ☆1,868Updated this week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,588Updated this week
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,320Updated 10 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,695Updated last year
- 🤗 AutoTrain Advanced☆4,548Updated 11 months ago
- Home of StarCoder2!☆2,024Updated last year
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,641Updated this week
- Build ChatGPT over your data, all with natural language☆6,524Updated last year
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,808Updated last month
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,075Updated last year
- Training LLMs with QLoRA + FSDP☆1,537Updated last year
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,911Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,403Updated last month
- ☆1,026Updated 2 years ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,313Updated 5 months ago
- Nomic Developer API SDK☆1,860Updated last month
- The #1 open-source voice interface for desktop, mobile, and ESP32 chips.☆5,105Updated last year
- This repository is deprecated and will be archived☆2,227Updated this week
- ☆1,550Updated last year
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,876Updated last year