Yuan-ManX / ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.
☆462Updated 2 weeks ago
Related projects: ⓘ
- Audio Dataset for training CLAP and other models☆615Updated 7 months ago
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆841Updated 2 weeks ago
- Pytorch implementation of the CREPE pitch tracker☆397Updated 3 months ago
- A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…☆399Updated last year
- LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]☆266Updated 5 months ago
- Keep track of big models in audio domain, including speech, singing, music etc.☆431Updated 8 months ago
- Learning audio concepts from natural language supervision☆458Updated 3 months ago
- a list of demo websites for automatic music generation research☆602Updated this week
- MU-LLaMA: Music Understanding Large Language Model☆219Updated 5 months ago
- Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation☆306Updated this week
- All-In-One Music Structure Analyzer☆402Updated 4 months ago
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆357Updated last year
- Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".☆294Updated 4 months ago
- AudioLDM training, finetuning, evaluation and inference.☆191Updated 3 months ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆402Updated 3 months ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆763Updated last month
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆309Updated 2 months ago
- An opensource music processing toolkit☆309Updated last year
- The Open Source Code of UniAudio☆509Updated last month
- Metadata, scripts and baselines for the MTG-Jamendo dataset☆262Updated 2 months ago
- Mustango: Toward Controllable Text-to-Music Generation☆323Updated last month
- This toolbox aims to unify audio generation model evaluation for easier comparison.☆286Updated 3 months ago
- Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.☆511Updated last year
- This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, so…☆289Updated last week
- Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs☆395Updated last month
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆183Updated 2 years ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆306Updated last month
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆272Updated last year
- Collection of audio-focused loss functions in PyTorch☆719Updated last month
- Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.☆544Updated last month