NVIDIA / RTX-AI-ToolkitLinks
The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.
☆168Updated 8 months ago
Alternatives and similar repositories for RTX-AI-Toolkit
Users that are interested in RTX-AI-Toolkit are comparing it to the libraries listed below
Sorting:
- An NVIDIA AI Workbench example project for customizing an SDXL model☆54Updated last month
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆126Updated last year
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆335Updated 2 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆178Updated 3 months ago
- llama.cpp fork used by GPT4All☆56Updated 5 months ago
- ☆160Updated this week
- ☆290Updated this week
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated 10 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 10 months ago
- An NVIDIA AI Workbench example project for fine-tuning a Nemotron-3 8B model☆50Updated last year
- automatically quant GGUF models☆188Updated last week
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆63Updated 9 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆129Updated last year
- Unsloth Studio☆98Updated 4 months ago
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆60Updated last year
- ☆95Updated 7 months ago
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…☆122Updated 10 months ago
- ☆102Updated 11 months ago
- Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A☆184Updated last month
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…☆164Updated 3 weeks ago
- The NVIDIA AIQToolkit UI streamlines interacting with AIQToolkit workflows in an easy-to-use web application.☆38Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆94Updated last year
- GRadient-INformed MoE☆264Updated 10 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆304Updated 9 months ago
- ☆261Updated last month
- Help shape the future of Project G-Assist☆149Updated last month
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated 7 months ago
- ☆117Updated 8 months ago
- Self-host LLMs with vLLM and BentoML☆139Updated last week
- ☆95Updated 7 months ago