speechcatcher-asr / speechcatcher
☆39Updated 3 weeks ago
Alternatives and similar repositories for speechcatcher:
Users that are interested in speechcatcher are comparing it to the libraries listed below
- ☆10Updated this week
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆32Updated 4 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆173Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Linguistic processing for Common Voice☆55Updated last year
- ☆35Updated 3 weeks ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆70Updated 7 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- BBB plugin for automatic subtitles in conference calls☆29Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆157Updated last year
- Various speech datasets made available to the public☆116Updated 3 months ago
- Online streaming speaker change detection model in Pytorch☆38Updated last year
- Python library for handling audio datasets.☆137Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- ☆39Updated last year
- Machine learning speaker characteristics☆33Updated this week
- Crawling and creating a German language model resource☆19Updated 2 years ago
- ☆54Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- ☆76Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated last week
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆38Updated 2 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 7 months ago
- A merged version of multiple open-source German speech datasets.☆31Updated 11 months ago
- Unofficial implementation of miipher☆120Updated 11 months ago
- ☆43Updated 2 years ago