AI Audio Datasets (AI-ADS) π΅, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
β939Jul 8, 2025Updated 10 months ago
Alternatives and similar repositories for ai-audio-datasets
Users that are interested in ai-audio-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audio Codec Speech processing Universal PERformance Benchmarkβ305May 5, 2026Updated 2 weeks ago
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.β1,796Jan 26, 2026Updated 3 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.β250Mar 7, 2025Updated last year
- Unified automatic quality assessment for speech, music, and sound.β719Jun 5, 2025Updated 11 months ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalizationβ103Feb 5, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"β373Sep 3, 2024Updated last year
- Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".β339Aug 4, 2025Updated 9 months ago
- Audio Dataset for training CLAP and other modelsβ738Jan 8, 2026Updated 4 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipelineβ202Dec 13, 2024Updated last year
- The Open Source Code of UniAudio