triple-Mu / HunyuanDiT-TensorRT-libtorchLinks
HunyuanDiT with TensorRT and libtorch
☆18Updated last year
Alternatives and similar repositories for HunyuanDiT-TensorRT-libtorch
Users that are interested in HunyuanDiT-TensorRT-libtorch are comparing it to the libraries listed below
Sorting:
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆41Updated 2 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Updated 2 years ago
- Stable Diffusion in TensorRT 8.5+☆15Updated 2 years ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Updated 9 months ago
- ☆27Updated 3 months ago
- ☆23Updated last year
- Deploy RT-EDTR with onnx from paddlepaddle framwork and graph cut☆31Updated 2 years ago
- c++实现的clip推理,模型有一点点改动,但是不大,改动和导出模型的代码可以在readme里找到,模型文件都在Releases里,包括AX650的模型。新增支持ChineseCLIP☆30Updated 3 months ago
- ☆20Updated last year
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆50Updated last year
- Awesome code, projects, books, etc. related to CUDA☆24Updated last month
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆59Updated last year
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆13Updated last month
- 大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM☆26Updated last year
- ☆79Updated 2 years ago
- ☆17Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Updated 7 months ago
- c++ implementation of mmpose inference, for pose estimation based on MNN☆13Updated 4 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆12Updated last year
- ☆14Updated 3 years ago
- segment-anything based mnn☆35Updated last year
- Whisper in TensorRT-LLM☆16Updated 2 years ago
- llm deploy project based onnx.☆44Updated 11 months ago
- ☆24Updated 2 years ago
- 使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序☆20Updated last year
- paper-read-notes☆12Updated last year
- Large Language Model Onnx Inference Framework☆36Updated 8 months ago
- A GLCC Server for MMDeploy☆19Updated 2 years ago
- Accelerate segment anything model inference using Tensorrt 8.6.1.6☆101Updated last year