This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
☆128Feb 29, 2024Updated 2 years ago
Alternatives and similar repositories for trt-llm-as-openai-windows
Users that are interested in trt-llm-as-openai-windows are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM☆3,124Jan 21, 2026Updated 3 months ago
- OpenAI compatible API for TensorRT LLM triton backend☆220Aug 1, 2024Updated last year
- An open source implementation of CLIP☆22Nov 6, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆39May 8, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- ☆25Feb 18, 2024Updated 2 years ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆59Updated this week
- ☆14Nov 22, 2024Updated last year
- The Triton TensorRT-LLM Backend☆934May 7, 2026Updated last week
- experiments with inference on llama☆103Jun 6, 2024Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆51Feb 13, 2025Updated last year
- This repository is intended as a comprehensive guide to prepare for interviews focused on generative AI. It serves as a one-stop resource…☆11Dec 13, 2024Updated last year
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆48Dec 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- Scriptable interface to a powerful, multi-lingual language server☆42Updated this week
- [ICML'24] Creative Text-to-Audio Generation via Synthesizer Programming☆40Sep 26, 2024Updated last year
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆45Mar 14, 2024Updated 2 years ago
- Wyoming protocol server that calls an external program to play audio☆19Apr 28, 2024Updated 2 years ago
- Copier template for creating a Mopidy extension☆17Nov 24, 2025Updated 5 months ago
- Official Repo for MoCha Towards Movie-Grade Talking Character Synthesis☆61Dec 27, 2025Updated 4 months ago
- ☆24Mar 6, 2024Updated 2 years ago
- 2D python IDE built into blender as an add-on!☆21Sep 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Mar 10, 2025Updated last year
- ☆23Feb 21, 2025Updated last year
- Useful utilities for huggingface☆25Dec 26, 2025Updated 4 months ago
- ☆15Jan 21, 2025Updated last year
- ☆11Sep 28, 2024Updated last year
- The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PC…☆184Nov 24, 2025Updated 5 months ago
- FlexiFilm: Long Video Generation with Flexible Conditions☆31May 1, 2024Updated 2 years ago
- ☆72Mar 10, 2026Updated 2 months ago
- NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction☆28Mar 14, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Apr 8, 2021Updated 5 years ago
- A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters☆58May 3, 2026Updated 2 weeks ago
- Generate images from an initial frame and text☆37Jul 28, 2023Updated 2 years ago
- Codebase for the paper HawkI: HawkI: Homography & Mutual Information Guidance for 3D-free Single Image to Aerial View☆13Jun 5, 2024Updated last year
- Code for "Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation"☆13Jul 10, 2020Updated 5 years ago
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆13Oct 9, 2023Updated 2 years ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆34Dec 12, 2023Updated 2 years ago