NVIDIA / speechsquadLinks
Conversational AI Benchmark.
☆68Updated 2 years ago
Alternatives and similar repositories for speechsquad
Users that are interested in speechsquad are comparing it to the libraries listed below
Sorting:
- ☆76Updated 4 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆156Updated 5 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- Automatic speech recognition using neural networks☆18Updated 5 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- A PyTorch implementation of DeepSpeech and DeepSpeech2.☆50Updated 6 years ago
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆115Updated 6 months ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆64Updated 4 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- ☆57Updated 4 years ago
- Collection of models and extensions for deployment in PyTorch☆24Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Updated 5 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆70Updated 3 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆83Updated 3 years ago
- bumble bee transformer☆14Updated 4 years ago
- Dataset Release for Intent Classification from Speech☆47Updated 8 months ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Automatic Speech Recognition Dataset Generation☆37Updated 7 years ago
- Tensorflow with KenLM integrated for beam search scoring☆34Updated 8 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- German Tacotron 2 and Multi-band MelGAN in TensorFlow with TF Lite inference support☆25Updated 4 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 5 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 6 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆88Updated 3 years ago
- Python API for reading and querying ARPA formatted language models.☆33Updated 11 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 6 months ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago