speechcatcher-asr / speechcatcherLinks
☆46Updated 2 months ago
Alternatives and similar repositories for speechcatcher
Users that are interested in speechcatcher are comparing it to the libraries listed below
Sorting:
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- ☆11Updated 4 months ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆35Updated 5 years ago
- ☆57Updated 2 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- Linguistic processing for Common Voice☆58Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 5 years ago
- ☆37Updated 2 months ago
- IPA Phonetic dataset lexicon☆18Updated 2 weeks ago
- ☆17Updated 2 years ago
- Online streaming speaker change detection model in Pytorch☆44Updated 2 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- Crawling and creating a German language model resource☆18Updated 3 years ago
- ☆22Updated 4 years ago
- Simple diarization model☆53Updated 7 months ago
- Phonetically-Oriented Word Error Rate☆36Updated 6 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆33Updated 2 weeks ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Updated 6 months ago
- Various speech datasets made available to the public☆130Updated last year
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46Updated 2 years ago
- ☆30Updated last year
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆59Updated last year
- A merged version of multiple open-source German speech datasets.☆34Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 4 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Updated 11 months ago
- ☆76Updated this week
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 3 weeks ago