NVIDIA / RTX-AI-ToolkitLinks
The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.
☆163Updated 7 months ago
Alternatives and similar repositories for RTX-AI-Toolkit
Users that are interested in RTX-AI-Toolkit are comparing it to the libraries listed below
Sorting:
- An NVIDIA AI Workbench example project for customizing an SDXL model☆52Updated last month
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆326Updated last month
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆122Updated last year
- automatically quant GGUF models☆187Updated this week
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆174Updated 2 months ago
- An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model☆50Updated last year
- llama.cpp fork used by GPT4All☆56Updated 4 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆61Updated 9 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 9 months ago
- ☆95Updated 6 months ago
- GRadient-INformed MoE☆263Updated 9 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated 9 months ago
- Unsloth Studio☆93Updated 3 months ago
- ☆159Updated 3 weeks ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 6 months ago
- API Server for Transformer Lab☆68Updated this week
- ☆267Updated this week
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated last year
- ☆56Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)☆93Updated 3 weeks ago
- An NVIDIA AI Workbench example project for an Agentic Retrieval Augmented Generation (RAG)☆87Updated last month
- ☆259Updated 3 weeks ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated last year
- ☆66Updated last year
- ☆101Updated 10 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆36Updated 11 months ago
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆48Updated 4 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆43Updated last month
- ☆95Updated 6 months ago