uhh-lt / subtitle2goLinks
☆25Updated last year
Alternatives and similar repositories for subtitle2go
Users that are interested in subtitle2go are comparing it to the libraries listed below
Sorting:
- An even smaller speech recognizer / force aligner☆36Updated 10 months ago
- Coqui Inference Engine☆41Updated 4 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Crawling and creating a German language model resource☆18Updated 3 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- ☆54Updated 2 years ago
- ☆17Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆40Updated 3 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 6 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆151Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- Multilingual Grapheme to Phoneme☆50Updated 9 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆40Updated 2 years ago
- ☆46Updated 3 weeks ago
- Model for recasing and repunctuating ASR transcripts☆141Updated last year
- Linguistic processing for Common Voice☆58Updated last year
- Proposed splits for the LREC Wikipron paper☆15Updated 5 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆41Updated last month
- C++ Implementation of the Information Bottleneck System☆23Updated 6 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆11Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆124Updated 11 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- scipts for working with open.bible data☆25Updated 3 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆30Updated 3 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Updated last year
- ☆13Updated 10 years ago