nvidia-riva / python-clients
Riva Python client API and CLI utils
☆91Updated 2 weeks ago
Alternatives and similar repositories for python-clients:
Users that are interested in python-clients are comparing it to the libraries listed below
- NVIDIA Riva runnable tutorials☆129Updated 3 weeks ago
- A toolkit for processing speech data and creating speech datasets☆108Updated this week
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- Sample C++ command-line Riva clients.☆32Updated 2 weeks ago
- NeMo text processing for ASR and TTS☆323Updated last week
- Collection of Open Source Speech Data☆153Updated 5 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆79Updated 6 months ago
- ☆355Updated 7 months ago
- ☆86Updated last week
- openvino version of openai/whisper☆166Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated 10 months ago
- ONNX implementation of Whisper. PyTorch free.☆92Updated 4 months ago
- ☆255Updated last year
- Websockets <-> Riva proxy service. Audiocodes compatible.☆14Updated 2 years ago
- ☆281Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆268Updated last year
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆174Updated this week
- ☆129Updated 4 months ago
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆95Updated 2 weeks ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆213Updated last year
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆251Updated 10 months ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆64Updated 3 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆369Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆173Updated 6 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- ☆187Updated 3 years ago
- [Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation☆150Updated 2 months ago