nvidia-riva / websocket-bridgeLinks
Websockets <-> Riva proxy service. Audiocodes compatible.
☆20Updated 2 years ago
Alternatives and similar repositories for websocket-bridge
Users that are interested in websocket-bridge are comparing it to the libraries listed below
Sorting:
- NVIDIA Riva runnable tutorials☆160Updated last month
- Riva Python client API and CLI utils☆117Updated 2 weeks ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- Sample C++ command-line Riva clients.☆38Updated 2 weeks ago
- Open TTS models, built for streaming on the edge☆45Updated 10 months ago
- ☆50Updated 3 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year
- ONNX and TensorRT implementation of Whisper☆66Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆104Updated 10 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆106Updated last year
- NeMo text processing for ASR and TTS☆424Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Finetune VITS and MMS using HuggingFace's tools☆191Updated last year
- NeMo -> Riva Conversion Tool☆21Updated 2 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 11 months ago
- A TTS model that makes a speaker speak new languages☆76Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.☆40Updated 9 months ago
- mnn tts demo.☆19Updated 9 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- ☆25Updated 3 years ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆215Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆24Updated 3 years ago
- ☆17Updated 4 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆267Updated last year
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated 2 years ago
- ☆40Updated 4 years ago
- Repository contains code to fine-tune WhisperASR model☆23Updated 3 years ago
- ☆45Updated 3 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆81Updated 4 years ago