nvidia-riva / cpp-clientsLinks
Sample C++ command-line Riva clients.
☆36Updated 3 weeks ago
Alternatives and similar repositories for cpp-clients
Users that are interested in cpp-clients are comparing it to the libraries listed below
Sorting:
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆92Updated 9 months ago
- A toolkit for processing speech data and creating speech datasets☆189Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated last year
- NVIDIA Riva runnable tutorials☆160Updated 3 weeks ago
- Onnx wrapper for espnet infrernce model☆169Updated 4 months ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆78Updated last month
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆138Updated 6 months ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆265Updated last year
- Riva Python client API and CLI utils☆114Updated 3 weeks ago
- ONNX and TensorRT implementation of Whisper☆65Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆38Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆64Updated this week
- NeMo text processing for ASR and TTS☆396Updated last week
- ☆37Updated 2 weeks ago
- ☆157Updated 3 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 6 months ago
- Various speech datasets made available to the public☆129Updated 11 months ago
- The demo page of UniAudio☆34Updated last year
- Conversational AI Benchmark.☆68Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- asr2k☆52Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆163Updated this week
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆95Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆77Updated 5 months ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆49Updated 4 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆220Updated 3 years ago
- ☆40Updated 3 years ago
- ☆21Updated 2 years ago