german-asr / kaldi-germanView external linksLinks
Scripts for training Kaldi for German speech recognition (ASR).
☆26Feb 11, 2021Updated 5 years ago
Alternatives and similar repositories for kaldi-german
Users that are interested in kaldi-german are comparing it to the libraries listed below
Sorting:
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆34May 3, 2024Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Feb 18, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Open Source Crimean Tatar Text-to-Speech datasets☆14Feb 23, 2025Updated 11 months ago
- ☆11May 7, 2022Updated 3 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTT…☆56Jul 30, 2021Updated 4 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- The kinyarwanda model for deepspeech☆17May 11, 2021Updated 4 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆175Aug 9, 2023Updated 2 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Example workflow for our data-centric speech benchmark☆17Jul 6, 2023Updated 2 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Open source cross-platform implementation of MRCP protocol☆20Mar 3, 2022Updated 3 years ago
- Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models☆15Feb 20, 2019Updated 6 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- ☆15May 8, 2021Updated 4 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech☆21Mar 21, 2022Updated 3 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago