NVIDIA / RTX-AI-ToolkitLinks
The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.
☆161Updated 7 months ago
Alternatives and similar repositories for RTX-AI-Toolkit
Users that are interested in RTX-AI-Toolkit are comparing it to the libraries listed below
Sorting:
- An NVIDIA AI Workbench example project for customizing an SDXL model☆52Updated 2 weeks ago
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆122Updated last year
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆60Updated 8 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated last year
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆325Updated 3 weeks ago
- ☆158Updated this week
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 9 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆59Updated last month
- The NVIDIA AIQToolkit UI streamlines interacting with AIQToolkit workflows in an easy-to-use web application.☆32Updated this week
- Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A☆122Updated last month
- GRadient-INformed MoE☆263Updated 9 months ago
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆284Updated 8 months ago
- ☆66Updated last year
- A curated list of OpenVINO based AI projects☆138Updated 2 weeks ago
- Collection of reference workflows for building intelligent agents with NIMs☆161Updated 5 months ago
- automatically quant GGUF models☆184Updated last week
- GPT-4 Level Conversational QA Trained In a Few Hours☆62Updated 10 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆171Updated last month
- An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model☆49Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated last year
- ☆95Updated 6 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 5 months ago
- ☆114Updated 6 months ago
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆58Updated last year
- llama.cpp fork used by GPT4All☆55Updated 4 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- A sleek and user-friendly interface for interacting with Ollama models, built with Python and Gradio.☆35Updated 2 months ago
- Route LLM requests to the best model for the task at hand.☆67Updated last week
- ☆101Updated 9 months ago