nvidia-riva / websocket-bridgeLinks
Websockets <-> Riva proxy service. Audiocodes compatible.
☆17Updated 2 years ago
Alternatives and similar repositories for websocket-bridge
Users that are interested in websocket-bridge are comparing it to the libraries listed below
Sorting:
- NVIDIA Riva runnable tutorials☆142Updated 3 weeks ago
- Sample C++ command-line Riva clients.☆34Updated 3 weeks ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Riva Python client API and CLI utils☆100Updated this week
- Finetune VITS and MMS using HuggingFace's tools☆162Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆331Updated 2 years ago
- ☆43Updated 2 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆262Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated 10 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆339Updated last year
- Convert English text from written expressions into spoken forms☆26Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Updated 6 months ago
- Various speech datasets made available to the public☆128Updated 8 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- ☆47Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Onnx wrapper for espnet infrernce model☆168Updated 2 weeks ago
- Add n-gram and large language model (LLM) support to Whisper models.☆31Updated 3 months ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆363Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 3 months ago
- ☆199Updated 3 years ago
- NeMo text processing for ASR and TTS☆359Updated last week
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆78Updated 3 years ago
- ☆123Updated this week
- Batch Support for OpenAI Whisper☆95Updated last year
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago