zldzmfoq12/VCtube

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zldzmfoq12/VCtube)

zldzmfoq12 / VCtube

A pakage for crawling audio from Youtube

☆42

Alternatives and similar repositories for VCtube

Users that are interested in VCtube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cjchun313 / intflowkict_2020_AI_Grand_Challenge
View on GitHub
2020 AI Grand Challenge (3rd track) - public sample
☆16Jan 20, 2021Updated 5 years ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 11 months ago
AiTeRLab-GIST / GIST_ASD_DETECTION
View on GitHub
Deep learning based autism spectral disorder detection from children voice
☆42Nov 5, 2025Updated 8 months ago
neosapience / editts
View on GitHub
Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)
☆122Jan 24, 2023Updated 3 years ago
cjchun313 / 2021_5th_MWP_Generator
View on GitHub
Problem Generator for Math Word Prediction
☆16Nov 28, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
intflow / YOLOX_AUDIO
View on GitHub
Audio event detection model based on YOLOX
☆86Nov 27, 2022Updated 3 years ago
fabianoluzbr / neural-g2p-portuguese
View on GitHub
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…
☆19Jun 14, 2021Updated 5 years ago
speechpro / mixup
View on GitHub
☆24Mar 13, 2020Updated 6 years ago
guanlongzhao / ppg-gmm
View on GitHub
Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"
☆36Jan 15, 2020Updated 6 years ago
r9y9 / segmentation-kit
View on GitHub
Speech Segmentation Toolkit using Julius
☆18Aug 19, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TalnUPF / praat_web
View on GitHub
☆13Jun 30, 2026Updated 3 weeks ago
aalto-speech / subword-kaldi
View on GitHub
Properly handle position-dependent phones in a subword lexicon FST
☆31Oct 26, 2020Updated 5 years ago
tachi-hi / tts_samples
View on GitHub
Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…
☆15May 30, 2021Updated 5 years ago
scarletcho / KoG2P
View on GitHub
Korean grapheme-to-phone conversion in Python
☆133Jan 27, 2020Updated 6 years ago
nii-yamagishilab / VCC2020-database
View on GitHub
☆53Dec 18, 2020Updated 5 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 5 years ago
AiTeRLab-GIST / GC_track3_DB_GIST
View on GitHub
3rd Grand Challenge track 3 DB developed by GIST
☆35Apr 9, 2021Updated 5 years ago
homink / speech.ko
View on GitHub
Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language
☆43Feb 28, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
yoosif0 / arabic_pronounce
View on GitHub
Pronounce Arabic words
☆19May 27, 2019Updated 7 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago
Kyubyong / g2pK
View on GitHub
g2pK: g2p module for Korean
☆271Mar 1, 2022Updated 4 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
Diamondfan / Child-ASR-Paper
View on GitHub
A list of papers for child ASR
☆54Oct 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
ronggong / MIREX-2018-Automatic-Lyrics-to-Audio-Alignment
View on GitHub
Util code, issues, discussions
☆29Aug 31, 2018Updated 7 years ago
Tomiinek / Blizzard2013_Segmentation
View on GitHub
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆45Nov 13, 2019Updated 6 years ago
gerazov / prosodeep
View on GitHub
Deep understanding and modelling of the hierarchical structure of prosody
☆25May 12, 2019Updated 7 years ago
sooftware / ksponspeech
View on GitHub
Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.
☆97Dec 24, 2021Updated 4 years ago
keonlee9420 / FastPitchFormant
View on GitHub
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
☆74Aug 3, 2021Updated 4 years ago
kan-bayashi / INTERSPEECH19_TUTORIAL
View on GitHub
Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago