ex3ndr / datasets
Declare your datasets and download them using a simple tool
☆9Updated 8 months ago
Alternatives and similar repositories for datasets:
Users that are interested in datasets are comparing it to the libraries listed below
- Supervoice diffusion enhance☆26Updated 8 months ago
- SpeechFlow neural network implementation☆19Updated 7 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 2 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- IPA Phonemizer/Dephonemizer for 136 human languages☆20Updated this week
- Acoustic Neighbor Embeddings☆21Updated 3 months ago
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 8 months ago
- Real-time end-to-end singing voice convertion☆21Updated 5 months ago
- Rust crate for some audio utilities☆22Updated 3 weeks ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 6 months ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- Tensor library for Zig☆11Updated 4 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated 3 weeks ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated 11 months ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated last year
- VoiceBox neural network implementation☆105Updated 8 months ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated last month
- ☆10Updated 10 months ago
- A small rust-based data loader☆24Updated 3 months ago
- StyleTTS 2 Optimized Training Fork☆26Updated 2 months ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆12Updated 6 months ago
- proof of concept conversation orchestrator with a speech-language model☆19Updated 5 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 5 months ago
- GPT for FACodec☆13Updated last year
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆20Updated last year
- ☆23Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- Github.io page for hosting the the synth1K1 dataset☆10Updated 3 years ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆16Updated 5 months ago
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated last year