bmilde / german-asr-lm-toolsLinks
Crawling and creating a German language model resource
☆19Updated 3 years ago
Alternatives and similar repositories for german-asr-lm-tools
Users that are interested in german-asr-lm-tools are comparing it to the libraries listed below
Sorting:
- Scripts for training Kaldi for German speech recognition (ASR).☆24Updated 4 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 6 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated 2 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Updated 5 years ago
- Workflow for forced alignment between languages☆20Updated last year
- ☆17Updated 2 years ago
- ☆11Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆43Updated 2 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Linguistic processing for Common Voice☆57Updated last year
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆40Updated 2 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated last year
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆10Updated 3 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- ☆17Updated 4 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆25Updated 6 months ago
- ☆22Updated 4 years ago
- Simple Kaldi recipe for forced alignment☆10Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- BurrMill core☆21Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year