NVIDIA / speechsquad
Conversational AI Benchmark.
☆68Updated last year
Alternatives and similar repositories for speechsquad:
Users that are interested in speechsquad are comparing it to the libraries listed below
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- ☆75Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Sample C++ command-line Riva clients.☆33Updated last week
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆87Updated 2 months ago
- Tensorflow with KenLM integrated for beam search scoring☆34Updated 7 years ago
- ☆56Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- ASR project with pytorch-lightning☆20Updated last month
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Demo and samples for universal speech translator☆23Updated 2 years ago
- JavaScript deployment for Howl, the wake word detection modeling toolkit for Firefox Voice☆10Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆46Updated 5 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆198Updated 2 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Dataset Release for Intent Classification from Speech☆46Updated 2 months ago
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 5 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 9 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Automatic speech recognition using neural networks☆19Updated 4 years ago
- Demos, samples, and experimental code for Lingvo.☆58Updated last year
- Code for AccentDB.☆20Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆13Updated 3 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Automatic Speech Recognition Dataset Generation☆37Updated 6 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆83Updated 2 years ago