NVIDIA-AI-IOT / whisper_trt
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆71Updated 4 months ago
Alternatives and similar repositories for whisper_trt:
Users that are interested in whisper_trt are comparing it to the libraries listed below
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- ONNX implementation of Whisper. PyTorch free.☆92Updated 3 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆46Updated 8 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 8 months ago
- openvino version of openai/whisper☆12Updated 4 months ago
- Riva Python client API and CLI utils☆84Updated this week
- A toolkit for processing speech data and creating speech datasets☆106Updated last week
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆239Updated 4 months ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆16Updated 9 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆61Updated 6 months ago
- NVIDIA Riva runnable tutorials☆123Updated 2 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆58Updated last year
- Running the F5-TTS by ONNX Runtime☆104Updated last week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆40Updated 4 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Dippy Synthetic Speech Subnet☆15Updated this week
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆32Updated 2 months ago
- A Toolkit to Help Optimize Onnx Model☆114Updated 3 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- A collection of reference AI microservices and workflows for Jetson Platform Services☆34Updated 3 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- The demo page of UniAudio☆34Updated last year
- Python scripts performing object detection using the YOLOv9 MIT model in ONNX.☆28Updated 5 months ago
- YoloV9 for a bare Raspberry Pi 4/5☆10Updated 8 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆117Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆24Updated 6 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆58Updated 6 months ago
- Model compression for ONNX☆86Updated 3 months ago