NVIDIA-AI-IOT / whisper_trtLinks
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆92Updated last year
Alternatives and similar repositories for whisper_trt
Users that are interested in whisper_trt are comparing it to the libraries listed below
Sorting:
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆101Updated 11 months ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆217Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆132Updated 5 months ago
- A Toolkit to Help Optimize Onnx Model☆228Updated this week
- NVIDIA Riva runnable tutorials☆154Updated last month
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆327Updated last year
- Riva Python client API and CLI utils☆111Updated 2 weeks ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated 3 weeks ago
- ☆114Updated 2 weeks ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆73Updated this week
- ☆15Updated 5 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Sample C++ command-line Riva clients.☆35Updated last month
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆50Updated last year
- ☆103Updated last week
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆61Updated 5 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆104Updated 3 weeks ago
- Model compression for ONNX☆97Updated 11 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆81Updated 5 months ago
- A toolkit for processing speech data and creating speech datasets☆180Updated last month
- openvino version of openai/whisper☆14Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- an optimized, production-ready implementation of active speaker detection☆72Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 8 months ago
- Converting weights of Pytorch models to ONNX & TensorRT engines☆50Updated 2 years ago
- ncnn HiFi-GAN☆29Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year