HunyuanDiT with TensorRT and libtorch
☆18May 22, 2024Updated last year
Alternatives and similar repositories for HunyuanDiT-TensorRT-libtorch
Users that are interested in HunyuanDiT-TensorRT-libtorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- ☆20Dec 29, 2023Updated 2 years ago
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- ☆22Apr 10, 2024Updated 2 years ago
- ☆30Nov 16, 2024Updated last year
- yolov7-pose end2end TRT实现☆27Sep 8, 2022Updated 3 years ago
- Training LLaMA language model with MMEngine! It supports LoRA fine-tuning!☆40Apr 2, 2023Updated 3 years ago
- Examples of AI model running on the board, such as horizon/rockchip and so on.☆21Jul 10, 2023Updated 2 years ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆54Mar 28, 2026Updated 2 weeks ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- In our implementation of Qwen-Image-Edit, we employ block causal attention to improve inference speed.☆48Feb 16, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Awesome code, projects, books, etc. related to CUDA☆32Mar 30, 2026Updated 2 weeks ago
- FastSAM 部署rknn C++ 代码☆13May 30, 2024Updated last year
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆82Aug 12, 2024Updated last year
- an example of segment-anything infer by ncnn☆124May 5, 2023Updated 2 years ago
- ☆14Feb 9, 2026Updated 2 months ago
- ☆85Mar 2, 2023Updated 3 years ago
- ☆48Mar 27, 2023Updated 3 years ago
- ☆42Nov 29, 2022Updated 3 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆31Jun 19, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- C++ TensorRT Implementation of NanoSAM☆51Dec 28, 2023Updated 2 years ago
- segmentation algorithm yolact use tensorrt deploy☆14May 7, 2022Updated 3 years ago
- a simple lightweight large language model pipeline framework.☆28Apr 25, 2025Updated 11 months ago
- "FastSAM_Awsome_Openvino" 项目展示了如何通过 OpenVINO 框架高效部署 FastSAM 模型,实现了令人瞩目的实例分割功能。该项目提供了 C++ 版本和 Python 版本两种实现,为开发者提供了在不同语言环境下使用 FastSAM 模型的选…☆38Dec 13, 2023Updated 2 years ago
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 4 months ago
- ☆20Jul 20, 2022Updated 3 years ago
- ☆26Aug 15, 2023Updated 2 years ago