voicelab-org / labelit
Flexible, extensible and scalable web-based speech annotation tool
☆13Updated last month
Alternatives and similar repositories for labelit:
Users that are interested in labelit are comparing it to the libraries listed below
- A guide to building language technology in new languages.☆58Updated 3 years ago
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆201Updated this week
- ☆56Updated 2 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆241Updated 6 months ago
- Various speech datasets made available to the public☆113Updated 2 months ago
- ☆42Updated 3 years ago
- ☆43Updated 2 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆177Updated 6 months ago
- ☆22Updated 2 years ago
- ☆350Updated 11 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- Unsupervised acoustic word embeddings evaluated on Buckeye English and NCHLT Xitsonga data in Python 3.☆8Updated 2 years ago
- Diarization scoring tools.☆235Updated last year
- ☆42Updated 7 years ago
- fiwGAN/ciwGAN (Featural and Categorical InfoWaveGAN): Generative Adversarial Phonology and Semantics☆23Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆36Updated 3 years ago
- ☆43Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆240Updated 5 years ago
- Linguistic processing for Common Voice☆53Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Massively multilingual pronunciation mining☆331Updated 3 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Server framework for Kaldi ASR Toolkit☆98Updated last year
- Spot the conversation: speaker diarisation in the wild☆134Updated 2 years ago
- Grapheme To Phoneme☆70Updated 6 months ago
- An end-to-end system which makes use of a recurrent encoder-decoder deep neural network to translate speech from the Hindi (Fourth most s…☆18Updated 5 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆108Updated 2 years ago
- A Python toolbox for speech features extraction☆161Updated 2 years ago