NVIDIA-AI-IOT / whisper_trtLinks
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆89Updated 10 months ago
Alternatives and similar repositories for whisper_trt
Users that are interested in whisper_trt are comparing it to the libraries listed below
Sorting:
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆101Updated 9 months ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆216Updated last year
- NVIDIA Riva runnable tutorials☆142Updated 3 weeks ago
- Riva Python client API and CLI utils☆100Updated last week
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆307Updated 10 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆61Updated 3 months ago
- A Toolkit to Help Optimize Onnx Model☆197Updated 2 weeks ago
- ☆12Updated 2 months ago
- A toolkit for processing speech data and creating speech datasets☆144Updated last week
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated last year
- ☆99Updated 11 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆25Updated 6 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆121Updated 3 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆176Updated last year
- ☆111Updated last week
- ☆31Updated last week
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆30Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆41Updated 8 months ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆31Updated 5 months ago
- ncnn HiFi-GAN☆28Updated 10 months ago
- Simple tool for partial optimization of ONNX. Further optimize some models that cannot be optimized with onnx-optimizer and onnxsim by se…☆19Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated 10 months ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆57Updated last year
- Using OpenVINO to speed up MeloTTS inference☆13Updated 9 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆66Updated 3 weeks ago