ozdefir / finetuneas
An HTML interface for finetuning the sync map output from aeneas
☆53Updated 2 years ago
Alternatives and similar repositories for finetuneas:
Users that are interested in finetuneas are comparing it to the libraries listed below
- Grapheme To Phoneme☆70Updated 6 months ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 9 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆153Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated last year
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- A tool for automatic phoneme transcription☆157Updated last year
- Server framework for Kaldi ASR Toolkit☆98Updated last year
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- pronunciation dictionaries for multiple languages☆86Updated 7 years ago
- ☆34Updated 4 months ago
- Python interface for forced audio alignment using HTK and SoX☆334Updated 4 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆157Updated 6 months ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- ARPABET transcription syllabifier module☆14Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆63Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 5 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆73Updated 3 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 8 months ago
- A Python toolbox for speech features extraction☆161Updated last year
- This repository☆30Updated 2 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 5 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago