nvidia-riva / python-clients
Riva Python client API and CLI utils
☆84Updated this week
Alternatives and similar repositories for python-clients:
Users that are interested in python-clients are comparing it to the libraries listed below
- NVIDIA Riva runnable tutorials☆124Updated this week
- A toolkit for processing speech data and creating speech datasets☆106Updated last week
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆71Updated 4 months ago
- Sample C++ command-line Riva clients.☆31Updated this week
- NeMo text processing for ASR and TTS☆311Updated 3 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- Create an LJSpeech structured voice dataset on wave input☆26Updated 4 months ago
- A TTS model that makes a speaker speak new languages☆76Updated 8 months ago
- Official Implementation of StyleTTS☆418Updated last month
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆199Updated 2 years ago
- Open models for Coqui STT☆129Updated last year
- ☆346Updated 5 months ago
- Websockets <-> Riva proxy service. Audiocodes compatible.☆14Updated last year
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆35Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆65Updated 8 months ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆246Updated 9 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆235Updated 8 months ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆196Updated 2 years ago
- Official Implementation of StyleTTS-VC☆175Updated last month
- ☆273Updated 8 months ago
- ☆182Updated 2 years ago
- Harness the power of NVIDIA technologies and LangChain to create dynamic avatars from live speech, integrating RIVA ASR and TTS with Audi…☆59Updated 7 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆341Updated last year
- ☆117Updated 2 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆160Updated 11 months ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆266Updated last year
- ☆251Updated last year