soupdtag / speak-tool
A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github.mit.edu)
☆12Updated 2 years ago
Alternatives and similar repositories for speak-tool
Users that are interested in speak-tool are comparing it to the libraries listed below
Sorting:
- Simple Kaldi recipe for forced alignment☆10Updated last year
- An extension of PHOIBLE that includes features for allophones.☆10Updated last year
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- ☆12Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 6 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- ☆26Updated 3 years ago
- ☆8Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- ☆40Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 7 months ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆9Updated 3 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 6 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- ☆18Updated last month
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 3 months ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 7 months ago
- ☆16Updated 4 years ago
- ☆17Updated 2 years ago
- Script to generate VAD dataset used in Asteroid recipe☆17Updated 3 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- A collection of papers related to speech model compression☆24Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 2 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago