EdVince / whisper-trtllmView external linksLinks
Whisper in TensorRT-LLM
☆17Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-trtllm
Users that are interested in whisper-trtllm are comparing it to the libraries listed below
Sorting:
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- mnn asr demo.☆25Mar 24, 2025Updated 10 months ago
- 很好用的tnn classify demo☆11Mar 24, 2021Updated 4 years ago
- Implementation of our paper published in Springer's Signal, Image and Video Processing☆11Dec 5, 2020Updated 5 years ago
- ☆13Jan 7, 2025Updated last year
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 2 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago
- ☆34Feb 3, 2025Updated last year
- ppstructure deploy by ncnn☆35Jul 16, 2024Updated last year
- ☆21Aug 14, 2024Updated last year
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- ☆23Oct 24, 2022Updated 3 years ago
- Deep Learning Model Optimization Using by TensorRT API, window☆16Aug 29, 2022Updated 3 years ago
- ☆20Sep 28, 2024Updated last year
- Object detection and instance segmentation on MaskRCNN with torchvision, albumentations, tensorboard and cocoapi. Supports custom coco da…☆18Sep 28, 2020Updated 5 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- 使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序☆20Jul 4, 2024Updated last year
- ☆17Aug 9, 2021Updated 4 years ago
- SCRFD face detection based on MNN inference framework☆17Sep 22, 2021Updated 4 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- 使用NCNN推理框架和ByteTrack目标跟踪框架,对网络、文件流URL进行实时性视频推理,而UI界面则由Qt框架实现☆24Oct 16, 2024Updated last year
- vLLM Router☆55Mar 11, 2024Updated last year
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 5, 2026Updated last week
- SGEMM optimization with cuda step by step☆21Mar 23, 2024Updated last year
- 基于NCNN框架实现车道线检测(C/C++)☆24Apr 21, 2025Updated 9 months ago
- ☆21Mar 22, 2021Updated 4 years ago
- ☆90Jun 30, 2023Updated 2 years ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆89Apr 8, 2024Updated last year
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Sep 13, 2025Updated 5 months ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Oct 20, 2023Updated 2 years ago
- An INTERFACE for darknet, which allow you to use darknet detector in your own program(C, C++, Python, etc...) to do something interesting…☆24Feb 3, 2021Updated 5 years ago
- DeepSparkInference has selected 216 inference models of both small and large sizes. The small models cover fields such as computer vision…☆27Updated this week
- ☆27Sep 1, 2023Updated 2 years ago
- ☆26Aug 15, 2023Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆114Sep 10, 2024Updated last year
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆25Jul 15, 2025Updated 7 months ago