NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,048Updated 5 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- Home of StarCoder2!☆1,967Updated last year
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆11,531Updated this week
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,169Updated 11 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,303Updated 3 weeks ago
- ☆1,000Updated 7 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,035Updated last year
- ☆1,554Updated last month
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,556Updated this week
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,316Updated last year
- Official inference library for Mistral models☆10,452Updated 5 months ago
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,429Updated 11 months ago
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆798Updated last year
- Blazingly fast LLM inference.☆6,049Updated last week
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,898Updated last year
- Local AI API Platform☆2,761Updated 2 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,413Updated 8 months ago
- LM Studio CLI☆3,631Updated this week
- Large Language Model Text Generation Inference☆10,477Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,088Updated last week
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,386Updated last week
- StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, …☆4,854Updated 6 months ago
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,676Updated last year
- Gemma open-weight LLM library, from Google DeepMind☆3,681Updated this week
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,290Updated last month
- Large-scale LLM inference engine☆1,537Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,457Updated 3 months ago
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,310Updated 6 months ago
- Agentic components of the Llama Stack APIs☆4,272Updated last month
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,071Updated 2 weeks ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,036Updated 10 months ago