bookbot-hive / k2-indonesian-asrLinks
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆15Updated 2 years ago
Alternatives and similar repositories for k2-indonesian-asr
Users that are interested in k2-indonesian-asr are comparing it to the libraries listed below
Sorting:
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 3 years ago
- g2p ID: Indonesian Grapheme-to-Phoneme Converter☆27Updated last year
- Transfer learning approach to pronunciation scoring☆11Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Updated last month
- ☆19Updated 3 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Updated last year
- ☆25Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 5 years ago
- ☆14Updated last year
- ☆17Updated 2 years ago
- ☆11Updated 4 years ago
- Convert English text from written expressions into spoken forms☆27Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated last year
- A handy dataset of noises for ASR☆22Updated 6 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆30Updated 3 months ago
- Wenet speech to text for react native☆10Updated 3 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Updated last year
- ☆28Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 10 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated last year
- ☆13Updated 4 years ago
- Just another FastSpeech 2 but cleaner code :)☆28Updated last year
- ☆37Updated last year
- Prosodic Speech Segmentation with Transformers☆26Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆20Updated 10 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 7 months ago