nvidia-riva / python-clients
Riva Python client API and CLI utils
☆73Updated last week
Related projects ⓘ
Alternatives and complementary repositories for python-clients
- NVIDIA Riva runnable tutorials☆118Updated last week
- A toolkit for processing speech data and creating speech datasets☆88Updated this week
- ONNX and TensorRT implementation of Whisper☆59Updated last year
- 🐸 - A general purpose model trainer, as flexible as it gets☆198Updated 8 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆63Updated last month
- Sample C++ command-line Riva clients.☆29Updated last week
- NeMo text processing for ASR and TTS☆285Updated this week
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆157Updated 8 months ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆232Updated 6 months ago
- A TTS model that makes a speaker speak new languages☆75Updated 5 months ago
- Official Implementation of StyleTTS☆401Updated 11 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆259Updated last year
- ☆54Updated this week
- Onnx wrapper for espnet infrernce model☆156Updated last month
- ☆307Updated 2 months ago
- Websockets <-> Riva proxy service. Audiocodes compatible.☆13Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆134Updated 10 months ago
- openvino version of openai/whisper☆161Updated last year
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆241Updated last year
- ☆296Updated 4 months ago
- Open models for Coqui STT☆122Updated last year
- openvino version of openai/whisper☆12Updated last month
- Faster Tortoise inference then Tortoise Fast Fork☆122Updated 7 months ago
- VoiceBox neural network implementation☆96Updated 3 months ago
- Create an LJSpeech structured voice dataset on wave input☆21Updated last month
- Google's SoundStorm: Efficient Parallel Audio Generation☆129Updated last year
- ☆257Updated 5 months ago
- Official Implementation of StyleTTS-VC☆164Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆257Updated last year