Whisper in TensorRT-LLM
☆17Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-trtllm
Users that are interested in whisper-trtllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago
- Implementation of our paper published in Springer's Signal, Image and Video Processing☆12Dec 5, 2020Updated 5 years ago
- mnn asr demo.☆26Mar 24, 2025Updated last year
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 很好用的tnn classify demo☆11Mar 24, 2021Updated 5 years ago
- Ultra fast head pose estimation on a bare Raspberry Pi 4 at 20 FPS☆10Dec 21, 2021Updated 4 years ago
- 使用NCNN推理框架和ByteTrack目标跟踪框架,对网络、文件流URL进行实时性视频推理,而UI界面则由Qt框架实现☆24Oct 16, 2024Updated last year
- ☆26Aug 15, 2023Updated 2 years ago
- ☆23Aug 14, 2024Updated last year
- IPASS -- Image Processing Algorithm Simulation Software☆11Jul 25, 2025Updated 8 months ago
- ☆27Sep 1, 2023Updated 2 years ago
- ☆33Feb 3, 2025Updated last year
- Fast Neural Network Super-resolution tool based on TensorRT☆15Sep 6, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ppstructure deploy by ncnn☆37Jul 16, 2024Updated last year
- ☆125Dec 15, 2023Updated 2 years ago
- ☆10Dec 19, 2023Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆113Sep 10, 2024Updated last year
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- Deploy ChatGLM on Modelz☆16Mar 20, 2023Updated 3 years ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆97Feb 20, 2026Updated last month
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- 使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序☆20Jul 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- vLLM Router☆55Mar 11, 2024Updated 2 years ago
- ☆15Dec 1, 2023Updated 2 years ago
- 该项目实现了图像超分辨率算法ELAN的TensorRT版本。☆30Jul 9, 2022Updated 3 years ago
- 5th place solution in "NIPS 2017: Non-targeted Adversarial Attack" (with solution in targeted attack and defence)☆10Nov 14, 2017Updated 8 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆91Feb 14, 2026Updated 2 months ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- SCRFD face detection based on MNN inference framework☆18Sep 22, 2021Updated 4 years ago
- 基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】☆42Jul 10, 2024Updated last year
- ☆20Feb 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Mar 22, 2021Updated 5 years ago
- Superresolution running on Rockchip NPU (RK3588, etc..)☆21Jul 7, 2024Updated last year
- 基于seq2edit (Gector) 的中文文本纠错。☆29Nov 15, 2022Updated 3 years ago
- 从MinerU中提取出来的文本检测识别部分,通过pytorch实现paddleocr的文本检测识别☆17Jun 2, 2025Updated 10 months ago
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- Efficient inference of large language models.☆150Sep 28, 2025Updated 6 months ago