EdVince/whisper-trtllm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/EdVince/whisper-trtllm)

EdVince / whisper-trtllm

Whisper in TensorRT-LLM

☆17

Alternatives and similar repositories for whisper-trtllm

Users that are interested in whisper-trtllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FeiGeChuanShu / trt2023
View on GitHub
NVIDIA TensorRT Hackathon 2023复赛选题：通义千问Qwen-7B用TensorRT-LLM模型搭建及优化
☆43Oct 20, 2023Updated 2 years ago
TRT2022 / ControlNet_TensorRT
View on GitHub
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛初赛第三名方案
☆50Aug 16, 2023Updated 2 years ago
sohaibali01 / low-light-video-enhancement
View on GitHub
Implementation of our paper published in Springer's Signal, Image and Video Processing
☆12Dec 5, 2020Updated 5 years ago
wangzhaode / mnn-asr
View on GitHub
mnn asr demo.
☆27Mar 24, 2025Updated last year
Zolewit / TNNdemo
View on GitHub
很好用的tnn classify demo
☆11Mar 24, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Rayrtfr / FasterTransformer
View on GitHub
Transformer related optimization, including BERT, GPT
☆17Jul 29, 2023Updated 3 years ago
Tlntin / trt2023
View on GitHub
☆26Aug 15, 2023Updated 2 years ago
JiangLiSJTU / token-ring
View on GitHub
☆13Jan 7, 2025Updated last year
Qengineering / Head-Pose-ncnn-Raspberry-Pi-4
View on GitHub
Ultra fast head pose estimation on a bare Raspberry Pi 4 at 20 FPS
☆10Dec 21, 2021Updated 4 years ago
GIBEREZ / Qt-NCNN-ByteTrack
View on GitHub
使用NCNN推理框架和ByteTrack目标跟踪框架，对网络、文件流URL进行实时性视频推理，而UI界面则由Qt框架实现
☆23Oct 16, 2024Updated last year
tensorchord / modelz-ChatGLM
View on GitHub
Deploy ChatGLM on Modelz
☆16Mar 20, 2023Updated 3 years ago
xiatwhu / trt2023
View on GitHub
☆27Sep 1, 2023Updated 2 years ago
EdPendragon / Six-in-a-row
View on GitHub
大一的软件课程设计I的Qt项目，一部分参考了黑马程序员的教程（十分感谢）。实现了具备背景、bgm的可视化界面六子棋，可以也改成五子棋，实现了人人对战、人机对战、机机对战（观看）功能，可以更改是否开启禁手。
☆10Apr 21, 2022Updated 4 years ago
FeiGeChuanShu / ncnn_ppstructure
View on GitHub
ppstructure deploy by ncnn
☆36Jul 16, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
BBuf / tensorrt-llm-moe
View on GitHub
☆34Feb 3, 2025Updated last year
LeiWang1999 / Stream-k.tvm
View on GitHub
☆20Sep 28, 2024Updated last year
daquexian / faster-rwkv
View on GitHub
☆126Dec 15, 2023Updated 2 years ago
vllm-project / tml-fa4
View on GitHub
FA4-based Relative Attention Kernel developed by TML and Colfax
☆17Jul 17, 2026Updated last week
zongwave / IPASS
View on GitHub
IPASS -- Image Processing Algorithm Simulation Software
☆11Jul 25, 2025Updated last year
Tlntin / ChatGLM2-6B-TensorRT
View on GitHub
☆90Jun 30, 2023Updated 3 years ago
tlc-pack / libflash_attn
View on GitHub
Standalone Flash Attention v2 kernel without libtorch dependency
☆113Sep 10, 2024Updated last year
6xdax / rk3588_yolov5_bytetrack
View on GitHub
☆10Dec 19, 2023Updated 2 years ago
jimmy-evo / opencl_kernels
View on GitHub
An easy way to run, test, benchmark and tune OpenCL kernel files
☆24Aug 25, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated last month
tongyuantongyu / TRT-NNScaler
View on GitHub
Fast Neural Network Super-resolution tool based on TensorRT
☆15Jun 6, 2026Updated last month
hpc203 / CoupledTPS-opencv-dnn
View on GitHub
使用OpenCV部署CoupledTPS，包含了肖像矫正，不规则边界的图像矩形化，旋转图像矫正，三个模型。依然是包含C++和Python两个版本的程序
☆21Jul 4, 2024Updated 2 years ago
yuking926 / RKNN-YOLO11
View on GitHub
用于YOLO11模型转化为RKNN模型在RK系列开发板上面部署
☆15Sep 21, 2025Updated 10 months ago
YangLinzhuo / cuda-sgemm-optimization
View on GitHub
CUDA SGEMM optimization note
☆15Oct 31, 2023Updated 2 years ago
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
RobinQu / instinct.cpp
View on GitHub
instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…
☆59Jul 5, 2024Updated 2 years ago
taishan1994 / Qwen2-UIE
View on GitHub
基于Qwen2模型进行通用信息抽取【实体/关系/事件抽取】
☆41Jul 10, 2024Updated 2 years ago
hpc203 / 3DDFA-V3-opencv-dnn
View on GitHub
使用opencv部署3D人脸重建3DDFA-V3，包含C++和Python两个版本的程序，只依赖opencv库就能运行
☆40Aug 19, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
liu-mengyang / trt-elan
View on GitHub
该项目实现了图像超分辨率算法ELAN的TensorRT版本。
☆31Jul 9, 2022Updated 4 years ago
Danielement321 / FM2S
View on GitHub
[MIR] Pytorch Implementation for FM2S, a denoising algorithm for fluorescence microscopy.
☆15Mar 13, 2026Updated 4 months ago
keddyjin / TensorRT_StableDiffusion_ControlNet
View on GitHub
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…
☆26Jul 21, 2023Updated 3 years ago
ling0322 / libllm
View on GitHub
Efficient inference of large language models.
☆151Sep 28, 2025Updated 10 months ago
JieRen98 / SGEMM-SASS-Annotation
View on GitHub
☆21Mar 22, 2021Updated 5 years ago
TRT2022 / trtllm-llama
View on GitHub
☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化
☆54Oct 20, 2023Updated 2 years ago
BADBADBADBOY / baipiaoOCR
View on GitHub
convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino
☆33Aug 16, 2023Updated 2 years ago