NVIDIA / ChatRTXLinks
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
☆2,982Updated 2 months ago
Alternatives and similar repositories for ChatRTX
Users that are interested in ChatRTX are comparing it to the libraries listed below
Sorting:
- ☆1,458Updated 2 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,053Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,140Updated 2 months ago
- ☆2,952Updated 8 months ago
- Yes, it's another chat over documents implementation... but this one is entirely local!☆1,770Updated 2 months ago
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,900Updated 8 months ago
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,830Updated 6 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,311Updated last year
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,405Updated 5 months ago
- AirLLM 70B inference with single 4GB GPU☆5,778Updated last month
- LM Studio CLI☆3,126Updated last week
- ☆4,130Updated last month
- H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/☆4,316Updated last month
- Large Language Model Text Generation Inference☆10,172Updated last week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,127Updated 2 months ago
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆1,834Updated 2 months ago
- Python bindings for the Transformer models implemented in C/C++ using GGML library.☆1,866Updated last year
- Official inference library for Mistral models☆10,275Updated 2 months ago
- A collection of standardized JSON descriptors for Large Language Model (LLM) files.☆798Updated 10 months ago
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,233Updated 11 months ago
- ☆971Updated 4 months ago
- Large Action Model framework to develop AI Web Agents☆6,067Updated 4 months ago
- Training LLMs with QLoRA + FSDP☆1,483Updated 6 months ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,202Updated this week
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,747Updated 2 weeks ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆2,017Updated 7 months ago
- ☆1,025Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,857Updated last month
- Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…☆1,259Updated last month
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,659Updated last year