NVIDIA / ChatRTX
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆2,902Updated 6 months ago
Alternatives and similar repositories for ChatRTX:
Users that are interested in ChatRTX are comparing it to the libraries listed below
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆2,832Updated this week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆9,468Updated this week
- ☆1,359Updated last week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆3,970Updated last week
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,161Updated 4 months ago
- Home of StarCoder2!☆1,864Updated 11 months ago
- Collection of notebook guides created by the Brev.dev team!☆1,720Updated 2 months ago
- ☆930Updated 2 weeks ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,879Updated 3 weeks ago
- Knowledge Agents and Management in the Cloud☆3,707Updated this week
- ☆8,572Updated 4 months ago
- Training LLMs with QLoRA + FSDP☆1,451Updated 3 months ago
- Examples in the MLX framework☆6,955Updated last week
- 👾 LM Studio TypeScript SDK (pre-release public alpha)☆788Updated this week
- We introduced a new model designed for the Code generation task. Its test accuracy on the HumanEval base dataset surpasses that of GPT-4 …☆834Updated 7 months ago
- 👾 LM Studio CLI☆2,638Updated this week
- Tools for merging pretrained large language models.☆5,273Updated last week
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,722Updated 4 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,842Updated last year
- Open source codebase powering the HuggingChat app☆8,221Updated this week
- Go ahead and axolotl questions☆8,648Updated this week
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆2,738Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,635Updated 6 months ago
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆797Updated 6 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,388Updated 2 months ago
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,507Updated 9 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,800Updated 7 months ago
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆14,565Updated this week
- Python bindings for llama.cpp☆8,668Updated 3 weeks ago
- Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.☆490Updated 3 weeks ago