NVIDIA / RTX-AI-ToolkitLinks
The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.
☆176Updated last year
Alternatives and similar repositories for RTX-AI-Toolkit
Users that are interested in RTX-AI-Toolkit are comparing it to the libraries listed below
Sorting:
- An NVIDIA AI Workbench example project for customizing an SDXL model☆57Updated 2 weeks ago
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆127Updated last year
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆67Updated last year
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆188Updated 6 months ago
- An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model☆56Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated last year
- llama.cpp fork used by GPT4All☆56Updated 8 months ago
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆344Updated 3 months ago
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…☆205Updated 4 months ago
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆64Updated last year
- ☆174Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆46Updated last month
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated last year
- GRadient-INformed MoE☆264Updated last year
- API Server for Transformer Lab☆78Updated this week
- Unsloth Studio☆116Updated 7 months ago
- ☆94Updated 11 months ago
- ☆55Updated 11 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆332Updated last year
- automatically quant GGUF models☆214Updated 3 weeks ago
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…☆134Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆34Updated last year
- Phi4 Multimodal Instruct - OpenAI endpoint and Docker Image for self-hosting☆40Updated 8 months ago
- Run LLMs in the Browser with MLC / WebLLM ✨☆145Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 8 months ago
- ☆102Updated last year
- ☆477Updated this week
- Collection of reference workflows for building intelligent agents with NIMs☆178Updated 10 months ago