NVIDIA / RTX-AI-Toolkit
The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCs and cloud.
☆144Updated 4 months ago
Alternatives and similar repositories for RTX-AI-Toolkit:
Users that are interested in RTX-AI-Toolkit are comparing it to the libraries listed below
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆120Updated last year
- An NVIDIA AI Workbench example project for customizing an SDXL model☆45Updated last week
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆307Updated 3 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 5 months ago
- automatically quant GGUF models☆163Updated this week
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 8 months ago
- llama.cpp fork used by GPT4All☆54Updated last month
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆51Updated 5 months ago
- A pipeline parallel training script for LLMs.☆132Updated this week
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆53Updated 9 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆151Updated 2 weeks ago
- ☆142Updated 3 weeks ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- ☆81Updated 3 months ago
- An NVIDIA AI Workbench example project for an Agentic Retrieval Augmented Generation (RAG)☆63Updated last month
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆249Updated 5 months ago
- Collection of reference workflows for building intelligent agents with NIMs☆149Updated 2 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 2 months ago
- AMD related optimizations for transformer models☆70Updated 4 months ago
- Distributed Inference for mlx LLm☆85Updated 7 months ago
- ☆125Updated this week
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆47Updated 2 weeks ago
- LM Studio JSON configuration file format and a collection of example config files.☆194Updated 7 months ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 8 months ago
- Rivet plugin for integration with Ollama, the tool for running LLMs locally easily☆36Updated 11 months ago
- GRadient-INformed MoE☆261Updated 5 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆49Updated 9 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 5 months ago
- ☆24Updated last year