NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,060Updated 5 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆11,707Updated this week
- Home of StarCoder2!☆1,975Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,338Updated last month
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,168Updated 11 months ago
- ☆1,003Updated 7 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,326Updated last month
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆2,900Updated last year
- ☆1,759Updated last week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,051Updated last year
- Granite Code Models: A Family of Open Foundation Models for Code Intelligence☆1,226Updated 3 months ago
- ☆3,028Updated last year
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,681Updated last year
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆800Updated last year
- Official inference library for Mistral models☆10,479Updated 6 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,324Updated last year
- Go ahead and axolotl questions☆10,496Updated this week
- Training LLMs with QLoRA + FSDP☆1,529Updated 10 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,572Updated this week
- PyTorch native post-training library☆5,508Updated this week
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,432Updated last year
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,310Updated 7 months ago
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,423Updated last week
- Run GGUF models easily with a KoboldAI UI. One File. Zero Install.☆8,597Updated this week
- Gemma open-weight LLM library, from Google DeepMind☆3,728Updated this week
- Tools for merging pretrained large language models.☆6,323Updated last week
- Foundational model for human-like, expressive TTS☆4,167Updated last year
- Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM☆1,448Updated 6 months ago
- Examples in the MLX framework☆7,881Updated 3 weeks ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,297Updated last month
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,049Updated 10 months ago