NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆43Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for trt2023
Users that are interested in trt2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago
- ☆26Aug 15, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 搜藏的希望的代码片段☆13Jun 6, 2023Updated 2 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆54Jan 30, 2024Updated 2 years ago
- ☆15Dec 1, 2023Updated 2 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Implementation of our paper published in Springer's Signal, Image and Video Processing☆12Dec 5, 2020Updated 5 years ago
- deepstream + cuda,yolo26,yolo-master,yolo11,yolov8,sam,transformer, etc.☆28Feb 7, 2026Updated 3 months ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆52Oct 20, 2023Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆113Sep 10, 2024Updated last year
- ONNX-compatible DocShadow: High-Resolution Document Shadow Removal. Supports TensorRT 🚀☆25Sep 13, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- 基于 CUDA Driver API 的 cuda 运行时环境☆16Jul 30, 2025Updated 9 months ago
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Mar 17, 2026Updated last month
- ☆27Sep 1, 2023Updated 2 years ago
- an example of segment-anything infer by ncnn☆124May 5, 2023Updated 3 years ago
- ☆18Jan 31, 2022Updated 4 years ago
- This is a repository to practice multi-thread programming in C++☆29Feb 21, 2024Updated 2 years ago
- ncnn version of CodeFormer☆109Mar 9, 2023Updated 3 years ago
- CenterNet3D 部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆13May 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- 大模型API性能指标比较 - 深入分析TTFT、TPS等关键指标☆20Sep 12, 2024Updated last year
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆15Oct 24, 2023Updated 2 years ago
- ☆20Mar 22, 2021Updated 5 years ago
- unofficial implementation of YOLOP TensorRT☆12Dec 11, 2021Updated 4 years ago
- ☆32Aug 25, 2023Updated 2 years ago
- yolov8s-pose using ncnn inferring!☆44Apr 27, 2023Updated 3 years ago
- a Android demo of depth_anything_v1 and depth_anything_v2☆70Jun 18, 2024Updated last year
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.☆74Sep 8, 2024Updated last year
- ☆79Nov 26, 2024Updated last year
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- LightNet-TRT is a high-efficiency and real-time implementation of convolutional neural networks (CNNs) using Edge AI.☆75Oct 3, 2023Updated 2 years ago
- ☆72Sep 8, 2022Updated 3 years ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆93Apr 8, 2024Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago