ex3ndr / datasetsLinks
Declare your datasets and download them using a simple tool
☆10Updated 10 months ago
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- Supervoice diffusion enhance☆27Updated 11 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- SpeechFlow neural network implementation☆21Updated 10 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 5 months ago
- dMel: Speech Tokenization Made Simple☆13Updated last month
- Rust crate for some audio utilities☆24Updated 3 months ago
- Acoustic Neighbor Embeddings☆24Updated 6 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated last month
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 7 months ago
- Neural model for prediction of stress position in Russian words☆11Updated this week
- The EveryVoice TTS Toolkit - Text To Speech for your language☆36Updated this week
- A small rust-based data loader☆29Updated 2 weeks ago
- ☆44Updated this week
- Project of Singing Voice Conversion.☆14Updated last year
- GPT-style network for phonemization with durations of text☆66Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆14Updated 9 months ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- ☆26Updated 3 weeks ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 8 months ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆33Updated last year
- Audio Entailment: Deductive Reasoning for Audio Understanding☆13Updated 6 months ago
- ☆10Updated last year
- GPT for FACodec☆13Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 3 months ago
- Unofficial implementation of wavenext vocoder☆47Updated 10 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆12Updated 10 months ago
- ☆13Updated 3 years ago
- Melody Lyric Transformer Implementation and Model☆10Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆27Updated 2 months ago
- VoiceBox neural network implementation☆109Updated 10 months ago