Whisper in TensorRT-LLM
☆17Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-trtllm
Users that are interested in whisper-trtllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago
- mnn asr demo.☆26Mar 24, 2025Updated last year
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 4 months ago
- 很好用的tnn classify demo☆11Mar 24, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Ultra fast head pose estimation on a bare Raspberry Pi 4 at 20 FPS☆10Dec 21, 2021Updated 4 years ago
- 使用NCNN推理框架和ByteTrack目标跟踪框架,对网络、文件流URL进行实时性视频推理,而UI界面则由Qt框架实现☆24Oct 16, 2024Updated last year
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- ☆13Jan 7, 2025Updated last year
- IPASS -- Image Processing Algorithm Simulation Software☆11Jul 25, 2025Updated 8 months ago
- ☆27Sep 1, 2023Updated 2 years ago
- ☆34Feb 3, 2025Updated last year
- ☆10Dec 19, 2023Updated 2 years ago
- ppstructure deploy by ncnn☆36Jul 16, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆20Sep 28, 2024Updated last year
- Standalone Flash Attention v2 kernel without libtorch dependency☆113Sep 10, 2024Updated last year
- ☆90Jun 30, 2023Updated 2 years ago
- Fast Neural Network Super-resolution tool based on TensorRT☆15Sep 6, 2025Updated 6 months ago
- PyTorch implementation of Image Super-Resolution via Deep Recursive Residual Network (CVPR 2017)☆23Jun 5, 2019Updated 6 years ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Feb 20, 2026Updated last month
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- 使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序☆20Jul 4, 2024Updated last year
- vLLM Router☆55Mar 11, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆15Dec 1, 2023Updated 2 years ago
- 基于NCNN框架实现车道线检测(C/C++)☆24Apr 21, 2025Updated 11 months ago
- 该项目实现了图像超分辨率算法ELAN的TensorRT版本。☆30Jul 9, 2022Updated 3 years ago
- 使用opencv部署3D人脸重建3DDFA-V3,包含C++和Python两个版本的程序,只依赖opencv库就能运行☆40Aug 19, 2024Updated last year
- SCRFD face detection based on MNN inference framework☆18Sep 22, 2021Updated 4 years ago
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆42Jul 10, 2024Updated last year
- ☆21Mar 22, 2021Updated 5 years ago
- 从MinerU中提取出来的文本检测识别部分,通过pytorch实现paddleocr的文本检测识别☆17Jun 2, 2025Updated 9 months ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Efficient inference of large language models.☆150Sep 28, 2025Updated 6 months ago
- SGEMM optimization with cuda step by step☆22Mar 23, 2024Updated 2 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Oct 20, 2023Updated 2 years ago
- ☆14May 22, 2019Updated 6 years ago
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago
- Batch video captioning using Qwen3-VL-8B vision-language model☆72Mar 3, 2026Updated 3 weeks ago