nvidia-riva / cpp-clientsLinks

Sample C++ command-line Riva clients.

☆34

Alternatives and similar repositories for cpp-clients

Users that are interested in cpp-clients are comparing it to the libraries listed below

Sorting:

NVIDIA / NeMo-speech-data-processor
A toolkit for processing speech data and creating speech datasets
☆133Updated this week
nvidia-riva / python-clients
Riva Python client API and CLI utils
☆99Updated last week
nvidia-riva / riva-asrlib-decoder
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆91Updated 5 months ago
nvidia-riva / tutorials
NVIDIA Riva runnable tutorials
☆140Updated last week
espnet / espnet_onnx
Onnx wrapper for espnet infrernce model
☆168Updated 10 months ago
NVIDIA / speechsquad
Conversational AI Benchmark.
☆68Updated 2 years ago
nvidia-riva / common
Protocol buffers and other common resources.
☆11Updated last week
NVIDIA / NeMo-text-processing
NeMo text processing for ASR and TTS
☆351Updated this week
huggingface / open_asr_leaderboard
☆116Updated last week
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆101Updated 10 months ago
rendchevi / nix-tts
🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation
☆255Updated last year
flashlight / text
Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.
☆69Updated 5 months ago
PINTO0309 / whisper-onnx-tensorrt
ONNX and TensorRT implementation of Whisper
☆64Updated 2 years ago
ccoreilly / wav2vec2-service
☆38Updated 3 years ago
google-research-datasets / cvss
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
☆207Updated 2 years ago
revdotcom / speech-datasets
Various speech datasets made available to the public
☆126Updated 7 months ago
lumaku / ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
☆339Updated last year
for-github-backup / deprecated.github.io
☆57Updated 3 years ago
NVIDIA / RAD-MMM
A TTS model that makes a speaker speak new languages
☆76Updated last year
Srijith-rkr / Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
☆254Updated last year
titu1994 / warprnnt_numba
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Updated 3 years ago
Mu-Y / DiariST
☆19Updated last year
RuABraun / texterrors
☆37Updated 3 months ago
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆46Updated 4 years ago
efeslab / LiteASR
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
☆118Updated 2 months ago
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
☆101Updated 2 years ago
Wadaboa / titanet
Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO
☆64Updated 2 years ago
xinjli / asr2k
asr2k
☆52Updated last year
HHousen / speaker-change-detection
Speaker change detection using SincNet and an LSTM/Transformer
☆53Updated 2 months ago
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆73Updated 4 years ago