alexkroman / tiny-audioLinks
Train your own speech AI model from scratch
☆28Updated last week
Alternatives and similar repositories for tiny-audio
Users that are interested in tiny-audio are comparing it to the libraries listed below
Sorting:
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆56Updated 2 years ago
- Temporary anonymous version☆22Updated last year
- GPT for FACodec☆13Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Updated last year
- Open Source Speech/Text Data on AI☆18Updated 3 years ago
- ☆17Updated 2 years ago
- asr2k☆52Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- ☆22Updated 4 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- ☆19Updated 9 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- ☆29Updated 10 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Updated 10 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Updated 2 months ago
- Putting flows on top of neural transducers for better TTS☆64Updated 2 weeks ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Updated 3 years ago
- A library of speech gadgets.☆14Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30Updated 2 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆34Updated 7 months ago
- ☆21Updated 7 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 4 years ago
- Collection of scripts from mHuBERT-147.☆32Updated last year
- ☆58Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆22Updated 4 months ago
- Official Code for ParrotTTS☆58Updated last year