egorsmkv/asr-corpus-creator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/egorsmkv/asr-corpus-creator)

egorsmkv / asr-corpus-creator

This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.

☆27

Alternatives and similar repositories for asr-corpus-creator

Users that are interested in asr-corpus-creator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AIRI-Institute / AI4TALK
View on GitHub
☆13Dec 7, 2022Updated 3 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
lociko / ukraine_itn_wfst
View on GitHub
Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini
☆19Oct 21, 2025Updated 9 months ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
egorsmkv / optimized-whisper
View on GitHub
Use quantized versions of Whisper to speed up inference
☆12Oct 16, 2024Updated last year
daanzu / wenet_stt_python
View on GitHub
☆33Nov 27, 2021Updated 4 years ago
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
Archivoice / nnsvs-chinese-support
View on GitHub
Hed and supporting files for Chinese NNSVS Dataset Creation
☆13Oct 14, 2025Updated 9 months ago
Open-Speech-EkStep / crowdsource-dataplatform
View on GitHub
This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…
☆17Mar 6, 2023Updated 3 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
m-wiesner / nnet_pytorch
View on GitHub
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Jul 25, 2024Updated 2 years ago
xinjli / transphone
View on GitHub
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆174Jun 9, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
homink / speech.ko
View on GitHub
Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language
☆43Feb 28, 2018Updated 8 years ago
xinjli / alqalign
View on GitHub
multilingual speech aligner
☆78Nov 19, 2023Updated 2 years ago
eatsleepraverepeat / reMUDE
View on GitHub
(re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition
☆17Jul 25, 2024Updated 2 years ago
nvidia-riva / riva-asrlib-decoder
View on GitHub
Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva
☆91Feb 18, 2025Updated last year
Minzard / Correctable-Pronunciation
View on GitHub
This is application for dysarthria to improve their pronunciation by using deep learning
☆10Dec 29, 2020Updated 5 years ago
EgorLakomkin / KTSpeechCrawler
View on GitHub
Automatically constructing corpus for automatic speech recognition from YouTube videos
☆157Feb 15, 2020Updated 6 years ago
stefan-it / ukrainian-electra
View on GitHub
Ukrainian ELECTRA model
☆12Mar 11, 2023Updated 3 years ago
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
ekapolc / gowajee_corpus
View on GitHub
Thai smart home corpus with "Gowajee" hotword
☆19Jul 30, 2023Updated 2 years ago
iisys-hof / HUI-Audio-Corpus-German
View on GitHub
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆35Mar 31, 2023Updated 3 years ago
meyersbs / SPLAT
View on GitHub
Speech Processing & Linguistic Analysis Tool
☆11Jun 30, 2019Updated 7 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
proger / uk
View on GitHub
Фонограми та синтагми: інструменти обробки
☆21Jun 21, 2025Updated last year
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
dbklim / StressRNN
View on GitHub
Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…
☆46Aug 7, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
Prem-kumar27 / Fast-KTSpeechCrawler
View on GitHub
Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler
☆23Mar 21, 2021Updated 5 years ago
oatsu-gh / utau_renderer_with_diff_svc
View on GitHub
Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model
☆10Aug 24, 2025Updated 11 months ago
sberdevices / qtacotron
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆14Mar 17, 2022Updated 4 years ago
steveash / jg2p
View on GitHub
Grapheme to phoneme toolkit using joint-modelling + CRFs in java
☆15Jul 14, 2018Updated 8 years ago
mmorise / no7_singing
View on GitHub
☆14Oct 11, 2024Updated last year