FeiGeChuanShu / trt2023View external linksLinks
NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆43Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for trt2023
Users that are interested in trt2023 are comparing it to the libraries listed below
Sorting:
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated 11 months ago
- deepstream + cuda,yolo26,yolo-master,yolo11,yolov8,sam,transformer, etc.☆35Feb 7, 2026Updated last week
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Oct 20, 2023Updated 2 years ago
- ☆26Aug 15, 2023Updated 2 years ago
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago
- 基于 CUDA Driver API 的 cuda 运行时环境☆15Jul 30, 2025Updated 6 months ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆52Jan 30, 2024Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆114Sep 10, 2024Updated last year
- ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀☆24Sep 13, 2023Updated 2 years ago
- Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.☆72Sep 8, 2024Updated last year
- ☆21Mar 22, 2021Updated 4 years ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Sep 13, 2025Updated 5 months ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆89Apr 8, 2024Updated last year
- ☆79Nov 26, 2024Updated last year
- Implementation of our paper published in Springer's Signal, Image and Video Processing☆11Dec 5, 2020Updated 5 years ago
- CenterNet3D 部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆13May 24, 2024Updated last year
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆12Jan 9, 2024Updated 2 years ago
- ☆13Jan 7, 2025Updated last year
- ☆15Dec 1, 2023Updated 2 years ago
- a naive example of LivePortrait infer by ncnn☆46Aug 7, 2024Updated last year
- ncnn 实现一些项目例子☆26Feb 17, 2023Updated 2 years ago
- ☆27Sep 1, 2023Updated 2 years ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆78Aug 12, 2024Updated last year
- A local search system implementation using Elasticsearch for Wikipedia data indexing and retrieval.☆12May 17, 2025Updated 8 months ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆26Jan 22, 2026Updated 3 weeks ago
- ncnn version of CodeFormer☆110Mar 9, 2023Updated 2 years ago
- JAX bindings for the flash-attention3 kernels☆20Jan 2, 2026Updated last month
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Jan 20, 2026Updated 3 weeks ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Triton Documentation in Chinese Simplified / Triton 中文文档☆103Dec 17, 2025Updated last month
- an example of segment-anything infer by ncnn☆124May 5, 2023Updated 2 years ago
- ☆31Aug 25, 2023Updated 2 years ago
- bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码☆33Aug 12, 2024Updated last year
- Recording models☆12Sep 19, 2023Updated 2 years ago
- ☆27Jan 7, 2026Updated last month
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated 11 months ago