nvidia-riva / cpp-clientsLinks
Sample C++ command-line Riva clients.
☆34Updated 2 weeks ago
Alternatives and similar repositories for cpp-clients
Users that are interested in cpp-clients are comparing it to the libraries listed below
Sorting:
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆92Updated 7 months ago
- NVIDIA Riva runnable tutorials☆149Updated 2 weeks ago
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- A toolkit for processing speech data and creating speech datasets☆174Updated last week
- Riva Python client API and CLI utils☆108Updated 2 weeks ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆73Updated 3 weeks ago
- Onnx wrapper for espnet infrernce model☆169Updated last month
- Conversational AI Benchmark.☆68Updated 2 years ago
- NeMo text processing for ASR and TTS☆376Updated last week
- A TTS model that makes a speaker speak new languages☆76Updated last year
- ☆133Updated 2 weeks ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- Various speech datasets made available to the public☆131Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆83Updated 3 years ago
- EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction☆264Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆37Updated 2 years ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆90Updated 11 months ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆160Updated 2 weeks ago
- ☆39Updated 3 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆257Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆341Updated last year
- asr2k☆52Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆118Updated 3 weeks ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆129Updated 4 months ago
- Pytorch Implementation of WaveNODE☆64Updated 5 years ago
- ☆37Updated 5 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆77Updated 3 years ago
- Read-only mirror of Pynini☆150Updated last month