NVIDIA / RTX-AI-ToolkitLinks

The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.

☆175

Alternatives and similar repositories for RTX-AI-Toolkit

Users that are interested in RTX-AI-Toolkit are comparing it to the libraries listed below

Sorting:

NVIDIA / workbench-example-sdxl-customization
An NVIDIA AI Workbench example project for customizing an SDXL model
☆57Updated 3 weeks ago
NVIDIA / trt-llm-as-openai-windows
This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…
☆126Updated last year
langchain-ai / langchain-nvidia
☆169Updated 2 weeks ago
NVIDIA / nim-anywhere
Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
☆182Updated 5 months ago
nomic-ai / llama.cpp
llama.cpp fork used by GPT4All
☆57Updated 8 months ago
NVIDIA-AI-Blueprints / ai-virtual-assistant
Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…
☆198Updated 3 months ago
NVIDIA / workbench-example-nemotron-finetune
An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model
☆54Updated last year
unslothai / unsloth-studio
Unsloth Studio
☆112Updated 6 months ago
NVIDIA / workbench-example-hybrid-rag
An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)
☆341Updated 2 months ago
microsoft / GRIN-MoE
GRadient-INformed MoE
☆264Updated last year
ThetaCursed / clean-ui
Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D
☆136Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆214Updated last week
NVIDIA / workbench-llamafactory
This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.
☆67Updated last year
marklysze / LlamaIndex-RAG-WSL-CUDA
Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B
☆129Updated last year
AK391 / gemini-gradio
☆95Updated 10 months ago
smaranjitghose / HiOllama
A sleek and user-friendly interface for interacting with Ollama models, built with Python and Gradio.
☆35Updated 6 months ago
google-ai-edge / ai-edge-apis
☆155Updated last month
jayrodge / Multimodal-RAG-with-Llama-3.2
Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…
☆133Updated last year
grctest / Electron-BitNet
Running Microsoft's BitNet via Electron, React & Astro
☆45Updated last month
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆65Updated last year
google-ai-edge / LiteRT-LM
☆445Updated this week
akx / ggify
Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp
☆160Updated 6 months ago
NVIDIA / workbench-example-mistral-finetune
An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model
☆63Updated last year
transformerlab / transformerlab-api
API Server for Transformer Lab
☆79Updated this week
NVIDIA-AI-Blueprints / aiq-research-assistant
☆199Updated last week
SakanaAI / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆34Updated last year
anastasiosyal / phi4-multimodal-instruct-server
Phi4 Multimodal Instruct - OpenAI endpoint and Docker Image for self-hosting
☆40Updated 7 months ago
NVIDIA / workbench-example-agentic-rag
An NVIDIA AI Workbench example project for an Agentic Retrieval Augmented Generation (RAG)
☆111Updated 2 months ago
cfahlgren1 / webllm-playground
Run LLMs in the Browser with MLC / WebLLM ✨
☆141Updated last year
mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated 8 months ago