NVIDIA / RTX-AI-Toolkit
The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.
☆143Updated 3 months ago
Alternatives and similar repositories for RTX-AI-Toolkit:
Users that are interested in RTX-AI-Toolkit are comparing it to the libraries listed below
- An NVIDIA AI Workbench example project for customizing an SDXL model☆45Updated 4 months ago
- llama.cpp fork used by GPT4All☆53Updated 3 weeks ago
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆119Updated last year
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 5 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆50Updated 4 months ago
- ☆140Updated 2 weeks ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆151Updated last week
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆306Updated 3 months ago
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆53Updated 9 months ago
- automatically quant GGUF models☆160Updated this week
- An NVIDIA AI Workbench example project for an Agentic Retrieval Augmented Generation (RAG)☆61Updated last month
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆134Updated 5 months ago
- ☆103Updated 4 months ago
- Collection of reference workflows for building intelligent agents with NIMs☆149Updated last month
- ☆55Updated 3 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 8 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆48Updated 9 months ago
- ☆99Updated 6 months ago
- ☆124Updated this week
- An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model☆45Updated 10 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆103Updated 5 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆21Updated this week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆44Updated 3 weeks ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆26Updated this week
- ☆91Updated 2 months ago
- AMD related optimizations for transformer models☆69Updated 4 months ago
- Unsloth Studio☆69Updated this week
- GRadient-INformed MoE☆261Updated 5 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago