This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
☆128Feb 29, 2024Updated 2 years ago
Alternatives and similar repositories for trt-llm-as-openai-windows
Users that are interested in trt-llm-as-openai-windows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM☆3,125Jan 21, 2026Updated 4 months ago
- OpenAI compatible API for TensorRT LLM triton backend☆221Aug 1, 2024Updated last year
- All-in-one Full-Featured Python/Flet/Flutter Application to make the most of all the latest Open-Source AI Art Generators in an intuitive…☆16May 30, 2025Updated last year
- The hackpack is a collection of resources to get started, and continue, your understanding of artificial intelligence.☆10Nov 1, 2018Updated 7 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆39May 8, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- ☆25Feb 18, 2024Updated 2 years ago
- experiments with inference on llama☆103Jun 6, 2024Updated 2 years ago
- ☆13Feb 18, 2023Updated 3 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- 2D python IDE built into blender as an add-on!☆21Sep 16, 2024Updated last year
- ☆24Mar 10, 2025Updated last year
- ☆24Feb 21, 2025Updated last year
- Scriptable interface to a powerful, multi-lingual language server☆43May 27, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Useful utilities for huggingface☆25Dec 26, 2025Updated 5 months ago
- ☆11Sep 28, 2024Updated last year
- The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PC…☆184Nov 24, 2025Updated 6 months ago
- Linkedin profile analyzer - built using openai assistants api, retrieval and gpt4 vision custom function☆34Feb 12, 2024Updated 2 years ago
- ☆75Mar 10, 2026Updated 3 months ago
- ☆70Apr 4, 2025Updated last year
- A simple node to download repos from HF specify a repo ID or File create a folder where you want to download the files then rename the fo…☆25Jul 14, 2025Updated 10 months ago
- A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters☆58May 3, 2026Updated last month
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Nov 10, 2019Updated 6 years ago
- ☆344May 8, 2026Updated last month
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- ☆63Nov 8, 2023Updated 2 years ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆34Dec 12, 2023Updated 2 years ago
- Knwler is a lightweight, single-file Python tool that extracts structured knowledge graphs from documents using AI. Feed it a PDF or text…☆129Apr 3, 2026Updated 2 months ago
- 🪐 🎛️ User interface to manage your Jupyter platform.☆17Apr 30, 2026Updated last month
- implementation of AnimateDiff.☆32Jul 14, 2023Updated 2 years ago
- ☆40May 10, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19Apr 23, 2025Updated last year
- ☆19Jan 17, 2025Updated last year
- Official implementation of Déjà View: Looping Transformers for Multi-View 3D Reconstruction☆160Jun 1, 2026Updated last week
- This is the official code for paper: [PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs]☆36Aug 31, 2024Updated last year
- ☆58Apr 11, 2024Updated 2 years ago
- Windows Forms user interface for making lip sync videos with DINet and OpenFace☆25Oct 14, 2023Updated 2 years ago
- An unofficial Unity port of the MERF viewer☆40Sep 19, 2023Updated 2 years ago