ylacombe / scripts_and_notebooks
A list of scripts/notebooks I'd like to keep handy
ā17Updated 8 months ago
Alternatives and similar repositories for scripts_and_notebooks:
Users that are interested in scripts_and_notebooks are comparing it to the libraries listed below
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.ā27Updated last year
- Repository for fine-tuning Transformers š¤ based seq2seq speech models in JAX/Flax.ā35Updated 2 years ago
- ā20Updated 2 years ago
- ā11Updated 2 months ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sā¦ā28Updated 2 years ago
- ā46Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inferenceā22Updated 2 months ago
- A TTS model that makes a speaker speak new languagesā76Updated 10 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā95Updated 6 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationā143Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lā¦ā23Updated 9 months ago
- Audio tokenization, in the fastest way possible!ā51Updated 8 months ago
- Open TTS models, built for streaming on the edgeā41Updated last month
- Prosodic Speech Segmentation with Transformersā25Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerā24Updated 4 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsā51Updated 4 years ago
- Putting flows on top of neural transducers for better TTSā62Updated last month
- ā56Updated 2 years ago
- Create training data for training a voice cloner for bark text to speech.ā44Updated last year
- 'Grad-TTS' with Multilingual Cleanersā10Updated last year
- Swarah: Indian-English speech dataset collected across the countryā30Updated last year
- ā10Updated last year
- asr2kā50Updated 11 months ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesā13Updated 2 years ago
- a simple system for 2-way interruptible voice interactions between human and LLMā28Updated last year
- Dippy Synthetic Speech Subnetā16Updated last month
- A python package for whisper normalizerā58Updated last week
- Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.ā17Updated 2 weeks ago
- ā17Updated 4 years ago
- š« check your data, before you wreck your modelā16Updated 2 years ago