nvidia-riva / python-clientsLinks
Riva Python client API and CLI utils
☆113Updated last month
Alternatives and similar repositories for python-clients
Users that are interested in python-clients are comparing it to the libraries listed below
Sorting:
- NVIDIA Riva runnable tutorials☆155Updated last month
- NeMo text processing for ASR and TTS☆386Updated 2 weeks ago
- A toolkit for processing speech data and creating speech datasets☆182Updated last month
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆351Updated 2 years ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆93Updated last year
- ☆314Updated last year
- 🐸 - A general purpose model trainer, as flexible as it gets☆227Updated last year
- ONNX and TensorRT implementation of Whisper☆65Updated 2 years ago
- Sample C++ command-line Riva clients.☆35Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 5 months ago
- ☆378Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆373Updated last year
- ☆261Updated last year
- openvino version of openai/whisper☆177Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆176Updated last year
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆264Updated last year
- ☆350Updated last year
- ☆151Updated 3 weeks ago
- Websockets <-> Riva proxy service. Audiocodes compatible.☆18Updated 2 years ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆72Updated last year
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆290Updated 6 months ago
- G2P☆355Updated 3 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆177Updated last year
- ONNX implementation of Whisper. PyTorch free.☆101Updated 11 months ago
- MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation☆396Updated 2 years ago
- Official Implementation of StyleTTS☆454Updated 10 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆121Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆135Updated 6 months ago