alumae / kiirkirjutaja
☆51Updated last year
Related projects ⓘ
Alternatives and complementary repositories for kiirkirjutaja
- ☆17Updated last year
- ☆77Updated 6 months ago
- ☆32Updated 2 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆100Updated last year
- Online streaming speaker change detection model in Pytorch☆36Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- asr2k☆48Updated 5 months ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆37Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month
- ☆16Updated 3 years ago
- ☆38Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 8 months ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆62Updated 4 years ago
- ☆20Updated 6 years ago
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- ☆56Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Convert English text from written expressions into spoken forms☆21Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- ☆22Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆144Updated last year
- Word Error Rate Estimation☆10Updated 4 years ago
- ☆56Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆26Updated last year