NVIDIA / speechsquad
Conversational AI Benchmark.
☆65Updated last year
Alternatives and similar repositories for speechsquad:
Users that are interested in speechsquad are comparing it to the libraries listed below
- ☆74Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Sample C++ command-line Riva clients.☆31Updated last week
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆83Updated last month
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆81Updated 2 years ago
- ASR project with pytorch-lightning☆20Updated 4 years ago
- Demos, samples, and experimental code for Lingvo.☆58Updated last year
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- Code for AccentDB.☆19Updated 3 years ago
- Sequence Modelling with CTC☆47Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- MozoLM: A language model (LM) serving library☆44Updated 2 months ago
- ☆48Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Tensorflow with KenLM integrated for beam search scoring☆34Updated 7 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- ☆41Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- Comprehensive Python library for speech and voice.☆33Updated 2 years ago