NVIDIA-AI-IOT / whisper_trt
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆81Updated 6 months ago
Alternatives and similar repositories for whisper_trt:
Users that are interested in whisper_trt are comparing it to the libraries listed below
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- ONNX implementation of Whisper. PyTorch free.☆95Updated 5 months ago
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆55Updated 10 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆118Updated last year
- NVIDIA Riva runnable tutorials☆130Updated last month
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 8 months ago
- Riva Python client API and CLI utils☆92Updated last week
- Using OpenVINO to speed up MeloTTS inference☆10Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆25Updated 9 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆256Updated 6 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆53Updated 3 weeks ago
- Collection of Open Source Speech Data☆153Updated 6 months ago
- A toolkit for processing speech data and creating speech datasets☆110Updated this week
- ☆107Updated last month
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆101Updated 3 weeks ago
- EdgeSAM model for use with Autodistill.☆26Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 5 months ago
- Running the F5-TTS by ONNX Runtime☆148Updated last week
- Sample C++ command-line Riva clients.☆33Updated last week
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆10Updated 6 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆166Updated last year
- openvino version of openai/whisper☆13Updated 7 months ago
- This repository provides optical character detection and recognition solution optimized on Nvidia devices.☆74Updated 3 weeks ago
- ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT☆206Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆39Updated 8 months ago
- ☆94Updated 7 months ago
- A Toolkit to Help Optimize Onnx Model☆145Updated this week