NVIDIA-AI-IOT / whisper_trtLinks
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆89Updated 11 months ago
Alternatives and similar repositories for whisper_trt
Users that are interested in whisper_trt are comparing it to the libraries listed below
Sorting:
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆99Updated 9 months ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆217Updated last year
- Riva Python client API and CLI utils☆102Updated 2 weeks ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated last year
- NVIDIA Riva runnable tutorials☆144Updated last month
- ☆114Updated 3 weeks ago
- A Toolkit to Help Optimize Onnx Model☆214Updated last week
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆126Updated 3 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆316Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆61Updated 4 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- ☆140Updated 3 weeks ago
- Using OpenVINO to speed up MeloTTS inference☆13Updated 10 months ago
- A collection of all our phonemeizers for dataset construction and inference☆26Updated 6 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆125Updated 2 years ago
- ncnn HiFi-GAN☆29Updated 11 months ago
- Running the F5-TTS by ONNX Runtime☆178Updated last month
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆176Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- ☆100Updated last year
- Onnx compatible styletts2 code☆13Updated 3 months ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆64Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆100Updated 2 months ago
- Sample C++ command-line Riva clients.☆34Updated last month
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …