NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆3,107Updated 2 weeks ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,174Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,154Updated last week
- Home of StarCoder2!☆2,034Updated last year
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,721Updated last week
- ☆1,894Updated this week
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,325Updated last year
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆799Updated last year
- ☆1,027Updated last year
- Local AI API Platform☆2,762Updated 7 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,076Updated last year
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,852Updated last year
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,700Updated last year
- The first open-source Artificial Narrow Intelligence generalist agentic framework Computer-Using-Agent that fully operates graphical-user…☆1,326Updated 11 months ago
- LM Studio CLI☆4,148Updated last week
- Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM☆1,463Updated 10 months ago
- LLM powered development for VSCode☆1,316Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,180Updated 5 months ago
- Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.☆3,756Updated last week
- Training LLMs with QLoRA + FSDP☆1,537Updated last year
- Modeling, training, eval, and inference code for OLMo☆6,299Updated 2 months ago
- Inference Llama 2 in one file of pure 🔥☆2,116Updated 2 months ago
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,315Updated 5 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,431Updated last month
- Generative AI extensions for onnxruntime☆953Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆4,226Updated this week
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,660Updated last week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,316Updated last year
- The official PyTorch implementation of Google's Gemma models☆5,601Updated 8 months ago
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,406Updated last year
- PyTorch native post-training library☆5,660Updated this week