NVIDIA / speechsquad
Conversational AI Benchmark.
☆66Updated last year
Alternatives and similar repositories for speechsquad:
Users that are interested in speechsquad are comparing it to the libraries listed below
- ☆75Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- ☆56Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆85Updated last month
- ASR project with pytorch-lightning☆20Updated this week
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆44Updated 3 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 5 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 4 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆62Updated 3 years ago
- Demo and samples for universal speech translator☆23Updated 2 years ago
- Demos, samples, and experimental code for Lingvo.☆58Updated last year
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆198Updated 2 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- Comprehensive Python library for speech and voice.☆32Updated 2 years ago
- Sample C++ command-line Riva clients.☆32Updated last week
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Sequence Modelling with CTC☆48Updated 2 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 8 months ago
- ☆49Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Code for AccentDB.☆20Updated 3 years ago
- Example implementation of Monotonic Chunkwise Attention.☆51Updated 7 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speech☆128Updated 4 years ago
- ☆15Updated 6 years ago