NVIDIA / speechsquad
Conversational AI Benchmark.
☆63Updated last year
Related projects: ⓘ
- ☆74Updated 2 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆80Updated 2 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆60Updated 3 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆151Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆109Updated 2 years ago
- ASR project with pytorch-lightning☆20Updated 4 years ago
- ☆22Updated this week
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆78Updated last month
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆41Updated 3 years ago
- Code for AccentDB.☆20Updated 3 years ago
- Demos, samples, and experimental code for Lingvo.☆57Updated last year
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- ☆56Updated last year
- Sample C++ command-line Riva clients.☆28Updated this week
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated last year
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated last year
- ☆38Updated last year
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 5 years ago
- ☆47Updated 2 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated last year
- Comprehensive Python library for speech and voice.☆33Updated last year
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 4 years ago
- ☆43Updated this week
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last month