Whisper in TensorRT-LLM
☆17Sep 21, 2023Updated 2 years ago
Alternatives and similar repositories for whisper-trtllm
Users that are interested in whisper-trtllm are comparing it to the libraries listed below
Sorting:
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Oct 20, 2023Updated 2 years ago
- 很好用的tnn classify demo☆11Mar 24, 2021Updated 4 years ago
- ☆13Jan 7, 2025Updated last year
- Implementation of our paper published in Springer's Signal, Image and Video Processing☆12Dec 5, 2020Updated 5 years ago
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 3 months ago
- 天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案☆50Aug 16, 2023Updated 2 years ago
- ☆34Feb 3, 2025Updated last year
- 从MinerU中提取出来的文本检测识别部分,通过pytorch实现paddleocr的文本检测识别☆17Jun 2, 2025Updated 9 months ago
- ppstructure deploy by ncnn☆36Jul 16, 2024Updated last year
- ☆22Aug 14, 2024Updated last year
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- ☆23Oct 24, 2022Updated 3 years ago
- Deep Learning Model Optimization Using by TensorRT API, window☆16Aug 29, 2022Updated 3 years ago
- SCRFD face detection based on MNN inference framework☆18Sep 22, 2021Updated 4 years ago
- ☆17Aug 9, 2021Updated 4 years ago
- Transformer related optimization, including BERT, GPT☆17Jul 29, 2023Updated 2 years ago
- 使用OpenCV部署CoupledTPS,包含了肖像矫正,不规则边界的图像矩形化,旋转图像矫正,三个模型。依然是包含C++和Python两个版本的程序☆20Jul 4, 2024Updated last year
- ☆20Sep 28, 2024Updated last year
- Object detection and instance segmentation on MaskRCNN with torchvision, albumentations, tensorboard and cocoapi. Supports custom coco da…☆18Sep 28, 2020Updated 5 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆24Aug 25, 2023Updated 2 years ago
- 使用NCNN推理框架和ByteTrack目标跟踪框架,对网络、文件流URL进行实时性视频推理,而UI 界面则由Qt框架实现☆24Oct 16, 2024Updated last year
- qwen2 and llama3 cpp implementation☆49Jun 7, 2024Updated last year
- SGEMM optimization with cuda step by step☆21Mar 23, 2024Updated last year
- 基于NCNN框架实现车道线检测(C/C++)☆24Apr 21, 2025Updated 10 months ago
- ☆21Mar 22, 2021Updated 4 years ago
- ☆90Jun 30, 2023Updated 2 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆51Oct 20, 2023Updated 2 years ago
- An INTERFACE for darknet, which allow you to use darknet detector in your own program(C, C++, Python, etc...) to do something interesting…☆24Feb 3, 2021Updated 5 years ago
- DeepSparkInference has selected 216 inference models of both small and large sizes. The small models cover fields such as computer vision…☆27Updated this week
- ☆27Sep 1, 2023Updated 2 years ago
- ☆25Aug 15, 2023Updated 2 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆114Sep 10, 2024Updated last year
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆25Jul 21, 2023Updated 2 years ago
- ncnn HiFi-GAN☆29Sep 29, 2024Updated last year
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆25Feb 22, 2026Updated 2 weeks ago
- ncnn version of CodeFormer☆109Mar 9, 2023Updated 3 years ago
- TensorRT for SOLO(use python)☆27Aug 19, 2022Updated 3 years ago
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago