hetpandya / youtube_tts_data_generatorView external linksLinks
A python library to generate speech dataset from Youtube videos
☆36Jun 7, 2024Updated last year
Alternatives and similar repositories for youtube_tts_data_generator
Users that are interested in youtube_tts_data_generator are comparing it to the libraries listed below
Sorting:
- Telegram Bot to Remove Image Background☆12Apr 18, 2021Updated 4 years ago
- Compute WER and SER for speech recognition evaluation☆26Dec 15, 2025Updated 2 months ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆63Dec 26, 2025Updated last month
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 10 months ago
- Jupyter Notebooks for creating Speech datasets☆46Mar 3, 2019Updated 6 years ago
- Official repository of Wavehax vocoder☆66Dec 20, 2025Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆37Sep 21, 2025Updated 4 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"☆109Oct 16, 2025Updated 4 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆36Jun 6, 2024Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 2 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Sep 8, 2021Updated 4 years ago
- ☆246Dec 21, 2025Updated last month
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆35Dec 31, 2023Updated 2 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 8 months ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆90Jan 3, 2022Updated 4 years ago
- Grapheme to phoneme conversion with deep learning.☆420Dec 8, 2023Updated 2 years ago
- Finetuning VITS Efficiently☆33Nov 6, 2023Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Mar 20, 2021Updated 4 years ago
- D&M Landing Page Engine - OpenSource PHP landing page engine/constructor to create landing pages with dynamic content☆10May 19, 2017Updated 8 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- Text Normalization utilities for normalizing text for TTS☆20Updated this week
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆50Jan 26, 2026Updated 3 weeks ago
- Multilingual G2P in 100 languages☆374May 26, 2023Updated 2 years ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Dec 7, 2025Updated 2 months ago
- Code for "Zero-Shot Out-of-Distribution Detection with Feature Correlations"☆13Jan 19, 2020Updated 6 years ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- A chrome extension that notifies when ChatGPT is done speaking☆11Aug 9, 2024Updated last year
- 👀 VITRina: VIsual Token Representations☆11Jun 15, 2023Updated 2 years ago
- ☆34Feb 9, 2026Updated last week
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- ☆37Nov 22, 2025Updated 2 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆42Aug 7, 2024Updated last year
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆75Jun 16, 2025Updated 8 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆106Oct 9, 2024Updated last year
- Undetectable fanfiction.net Downloader with Parallel Downloading☆10Oct 23, 2021Updated 4 years ago
- Manage mikrotik devices☆14Jan 24, 2023Updated 3 years ago