bmilde / german-asr-lm-toolsLinks
Crawling and creating a German language model resource
☆18Updated 3 years ago
Alternatives and similar repositories for german-asr-lm-tools
Users that are interested in german-asr-lm-tools are comparing it to the libraries listed below
Sorting:
- Scripts for training Kaldi for German speech recognition (ASR).☆26Updated 4 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- ☆17Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46Updated 2 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆22Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 5 years ago
- Workflow for forced alignment between languages☆23Updated last year
- Balanced Error Rate for Speaker Diarization☆33Updated 2 years ago
- Simple Kaldi recipe for forced alignment☆11Updated 2 years ago
- ☆12Updated 4 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Updated 3 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- ☆17Updated 6 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 3 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Updated 3 years ago
- Linguistic processing for Common Voice☆58Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆23Updated last year
- ☆17Updated 4 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆35Updated 6 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆40Updated 2 years ago
- wake word spotting with kaldi☆19Updated 5 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 7 months ago