ylacombe / scripts_and_notebooks
A list of scripts/notebooks I'd like to keep handy
β16Updated 7 months ago
Alternatives and similar repositories for scripts_and_notebooks:
Users that are interested in scripts_and_notebooks are comparing it to the libraries listed below
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β35Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- A simple voice conversion toolβ17Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- Open TTS models, built for streaming on the edgeβ39Updated 3 weeks ago
- A TTS model that makes a speaker speak new languagesβ76Updated 9 months ago
- A collection of utilities for handling IPA phones.β25Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ49Updated 9 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsβ15Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β22Updated 8 months ago
- Collection of scripts from mHuBERT-147.β24Updated 4 months ago
- Create training data for training a voice cloner for bark text to speech.β44Updated last year
- A collection of all our phonemeizers for dataset construction and inferenceβ22Updated last month
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Updated 4 years ago
- asr2kβ49Updated 10 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β27Updated 2 years ago
- β12Updated 2 months ago
- β56Updated 9 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ51Updated 4 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β95Updated 6 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.β50Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.β61Updated this week
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β21Updated 3 weeks ago
- β24Updated last year
- Demo for 2022 Interspeechβ29Updated 2 years ago
- β86Updated this week
- StyleTTS 2 Optimized Training Forkβ27Updated 2 months ago