uhh-lt / subtitle2go
β27Updated last year
Alternatives and similar repositories for subtitle2go:
Users that are interested in subtitle2go are comparing it to the libraries listed below
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ145Updated 9 months ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ37Updated 2 years ago
- β39Updated last month
- Aalto Automatic Speech Recognition toolsβ86Updated 7 years ago
- Crawling and creating a German language model resourceβ19Updated 2 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines β¦β58Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Coqui Inference Engineβ38Updated 3 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ36Updated 3 years ago
- My guide to create an italian TTS with Coquiβ14Updated 3 years ago
- Proposed splits for the LREC Wikipron paperβ14Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated last year
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β32Updated 4 years ago
- An even smaller speech recognizer / force alignerβ32Updated 2 months ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Pythonβ18Updated last year
- π LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.β22Updated 5 years ago
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 5 years ago
- Python wrapper for phonetisaurus grapheme to phoneme toolβ12Updated 3 years ago
- Command line tool to create corpora for Common Voiceβ75Updated 8 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated last year
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone β¦β41Updated 2 years ago
- Audiobook alignment for Indigenous languagesβ38Updated last week
- Adapting your own Language Model for Kaldiβ64Updated 6 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.β173Updated last year