Prem-kumar27 / Fast-KTSpeechCrawlerLinks

Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler

☆24

Alternatives and similar repositories for Fast-KTSpeechCrawler

Users that are interested in Fast-KTSpeechCrawler are comparing it to the libraries listed below

Sorting:

cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
revdotcom / words2num
Convert words to numbers
☆20Updated 3 years ago
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated last year
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 3 months ago
lingjzhu / spoken_sent_embedding
Unsupervised spoken sentence embeddings
☆14Updated 2 years ago
idiap / inv-tn
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Updated 7 years ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
speechio / asr-noises
A handy dataset of noises for ASR
☆22Updated 6 years ago
CiscoDevNet / g2p_seq2seq_pytorch
Grapheme to phoneme model for PyTorch
☆41Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
gpu-poor / gramvaani_hindi_asr
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Updated 3 years ago
xinjli / phonepiece
phone inventory library
☆16Updated 2 years ago
getalp / mass-dataset
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
☆50Updated 10 months ago
kamperh / globalphone_awe
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Updated 4 years ago
vivianngo97 / Punctuation_Transcription
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Updated 5 years ago
SpeechColab / PySpeechColab
A library of speech gadgets.
☆13Updated 2 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
☆11Updated 3 years ago
MiniXC / phones
A collection of utilities for handling IPA phones.
☆25Updated last year
Chung-I / youtube-asr-crawler
☆10Updated 2 years ago
alumae / torch-xvectors-wav
☆22Updated 4 years ago
EMRAI / emrai-synthetic-diarization-corpus
☆20Updated 6 years ago
MiuLab / Lattice-Transformer-SLU
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆11Updated 5 years ago
gheyret / UQSpeechDataset
Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット
☆29Updated 3 years ago
kate-egorova / ASR-hybrid-decoding
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Updated 5 years ago
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
Using YouTube to prepare a speech recognition dataset for any language
☆10Updated 4 years ago
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated 2 years ago
cadia-lvl / punctuation-prediction
Support tools for punctuation and boundary detection for ASR output.
☆57Updated 2 years ago