FeiGeChuanShu/trt2023

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FeiGeChuanShu/trt2023)

FeiGeChuanShu / trt2023

NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化

☆43

Alternatives and similar repositories for trt2023

Users that are interested in trt2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EdVince / whisper-trtllm
View on GitHub
Whisper in TensorRT-LLM
☆17Sep 21, 2023Updated 2 years ago
triple-mu / HunyuanDiT-TensorRT-libtorch
View on GitHub
HunyuanDiT with TensorRT and libtorch
☆18May 22, 2024Updated 2 years ago
DataXujing / YOLOv12-TensorRT
View on GitHub
YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现
☆14Mar 5, 2025Updated last year
Phoenix8215 / learn-TensorRT-from-scratch
View on GitHub
learn TensorRT from scratch🥰
☆18Sep 29, 2024Updated last year
TRT2022 / ControlNet_TensorRT
View on GitHub
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛初赛第三名方案
☆50Aug 16, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Tlntin / trt2023
View on GitHub
☆26Aug 15, 2023Updated 2 years ago
richjjj / duscratch
View on GitHub
搜藏的希望的代码片段
☆13Jun 6, 2023Updated 3 years ago
ZHEQIUSHUI / SAM-ONNX-AX650-CPP
View on GitHub
SAM and lama inpaint，包含QT的GUI交互界面，实现了交互式可实时显示结果的画点、画框进行SAM，然后通过进行Inpaint，具体操作看readme里的视频。
☆54Jan 30, 2024Updated 2 years ago
Tlntin / Qwen-TensorRT-LLM
View on GitHub
☆619Jul 31, 2024Updated last year
cqu20160901 / DETR_onnx_tensorRT_V2
View on GitHub
DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。
☆12Jan 9, 2024Updated 2 years ago
TRT2022 / trtllm-llama
View on GitHub
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
☆54Oct 20, 2023Updated 2 years ago
tlc-pack / libflash_attn
View on GitHub
Standalone Flash Attention v2 kernel without libtorch dependency
☆113Sep 10, 2024Updated last year
sohaibali01 / low-light-video-enhancement
View on GitHub
Implementation of our paper published in Springer's Signal, Image and Video Processing
☆12Dec 5, 2020Updated 5 years ago
YdrMaster / cuda-driver
View on GitHub
基于 CUDA Driver API 的 cuda 运行时环境
☆16Jul 30, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
richjjj / cuvid-tensorrt-multi
View on GitHub
ffmpeg+cuvid+tensorrt+multicamera
☆12Dec 31, 2024Updated last year
fabio-sim / DocShadow-ONNX-TensorRT
View on GitHub
ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀
☆25Sep 13, 2023Updated 2 years ago
triple-mu / Qwen-Image-TensorRT
View on GitHub
Qwen-Image's DiT inference with TensorRT-10
☆21Oct 13, 2025Updated 9 months ago
ros-perception / point_cloud_transport_tutorial
View on GitHub
This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…
☆11Mar 17, 2026Updated 4 months ago
xiatwhu / trt2023
View on GitHub
☆27Sep 1, 2023Updated 2 years ago
ppogg / ncnn-android-v5lite
View on GitHub
☆18Jan 31, 2022Updated 4 years ago
cqu20160901 / FastSAM_rknn_Cplusplus
View on GitHub
FastSAM 部署rknn C++ 代码
☆13May 30, 2024Updated 2 years ago
FeiGeChuanShu / ncnn_Android_LivePortrait
View on GitHub
a naive example of LivePortrait infer by ncnn
☆47Aug 7, 2024Updated last year
kalfazed / multi-thread-programming
View on GitHub
This is a repository to practice multi-thread programming in C++
☆31Feb 21, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
FeiGeChuanShu / CodeFormer-ncnn
View on GitHub
ncnn version of CodeFormer
☆109Mar 9, 2023Updated 3 years ago
cqu20160901 / centernet3d_onnx_rknn_horizon_tensorRT
View on GitHub
CenterNet3D 部署版本，便于移植不同平台（onnx、tensorRT、rknn、Horizon）。
☆14May 24, 2024Updated 2 years ago
xiatwhu / baidu_topk
View on GitHub
☆15Dec 1, 2023Updated 2 years ago
weishengying / cutlass_flash_atten_fp8
View on GitHub
使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention
☆82Aug 12, 2024Updated last year
morsoli / llmbenchmark
View on GitHub
大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标
☆20Sep 12, 2024Updated last year
yhwang-hub / yolov5_QAT
View on GitHub
Quantize yolov5 using pytorch_quantization.🚀🚀🚀
☆15Oct 24, 2023Updated 2 years ago
JieRen98 / SGEMM-SASS-Annotation
View on GitHub
☆21Mar 22, 2021Updated 5 years ago
SeungHwi0613 / ros2_bevfusion_demo
View on GitHub
☆32Aug 25, 2023Updated 2 years ago
Stephenfang51 / YOLOP-TensorRT
View on GitHub
unofficial implementation of YOLOP TensorRT
☆12Dec 11, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Rachel-liuqr / yolov8s-pose-ncnn
View on GitHub
yolov8s-pose using ncnn inferring!
☆45Apr 27, 2023Updated 3 years ago
mailliw2010 / infer-frame
View on GitHub
a ai infra framework for edge device base on nndeploy
☆18Nov 27, 2025Updated 8 months ago
BaofengZan / mnn-llm-GOT-OCR2.0
View on GitHub
使用mnn-llm对GOT-OCR2.0进行推理
☆14Oct 2, 2024Updated last year
FeiGeChuanShu / ncnn-android-depth_anything
View on GitHub
a Android demo of depth_anything_v1 and depth_anything_v2
☆73Jun 18, 2024Updated 2 years ago
ppogg / Deepstream-Box
View on GitHub
deepstream + cuda，yolo26，yolo-master，yolo11，yolov8，sam，transformer, etc.
☆27Feb 7, 2026Updated 5 months ago
Bruce-Lee-LY / cuda_hgemv
View on GitHub
Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.
☆75Sep 8, 2024Updated last year
casper-hansen / AutoAWQ_kernels
View on GitHub
☆80Nov 26, 2024Updated last year