bookbot-hive / k2-indonesian-asrView external linksLinks
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆15Jun 30, 2023Updated 2 years ago
Alternatives and similar repositories for k2-indonesian-asr
Users that are interested in k2-indonesian-asr are comparing it to the libraries listed below
Sorting:
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 10 months ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- Train a fiwGAN or ciwGAN model using your own training data☆14Oct 13, 2022Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 10 months ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 4 months ago
- Crowdsourced and Automatic Speech Prominence Estimation☆24Apr 12, 2024Updated last year
- Thai smart home corpus with "Gowajee" hotword☆18Jul 30, 2023Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Aug 9, 2023Updated 2 years ago
- Resources that make every language unique☆26Jan 18, 2026Updated 3 weeks ago
- ☆24Jan 14, 2021Updated 5 years ago
- Enhanced Piper TTS with Japanese support, WebAssembly, multi-GPU training, and quality improvements. Features OpenJTalk integration, brow…☆29Updated this week
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆19May 12, 2023Updated 2 years ago
- A list of papers for child ASR☆52Oct 8, 2024Updated last year
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆24Aug 21, 2024Updated last year
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆21Jul 26, 2024Updated last year
- Acoustic Neighbor Embeddings☆29Jul 13, 2025Updated 7 months ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- ☆23Oct 17, 2024Updated last year
- Extract phoneme-level timestamps from speeh audio.☆116Updated this week
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 3 months ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago