Yuan-ManX / ai-audio-datasetsLinks

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

☆793

Alternatives and similar repositories for ai-audio-datasets

Users that are interested in ai-audio-datasets are comparing it to the libraries listed below

Sorting:

LAION-AI / audio-dataset
Audio Dataset for training CLAP and other models
☆693Updated last year
NVIDIA / BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,079Updated 11 months ago
liusongxiang / Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
☆491Updated 10 months ago
EmulationAI / awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
☆685Updated last year
maxrmorrison / torchcrepe
Pytorch implementation of the CREPE pitch tracker
☆466Updated 2 months ago
affige / genmusic_demo_list
a list of demo websites for automatic music generation research
☆717Updated this week
yizhilll / MERT
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆388Updated 2 months ago
guan-yuan / Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…
☆439Updated 2 years ago
facebookresearch / audiobox-aesthetics
Unified automatic quality assessment for speech, music, and sound.
☆553Updated 2 months ago
haoheliu / AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
☆268Updated 7 months ago
gemelo-ai / vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
☆964Updated last year
mir-aidj / all-in-one
All-In-One Music Structure Analyzer
☆601Updated last year
metame-ai / awesome-audio-plaza
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
☆392Updated last week
microsoft / CLAP
Learning audio concepts from natural language supervision
☆578Updated 10 months ago
shansongliu / MU-LLaMA
MU-LLaMA: Music Understanding Large Language Model
☆283Updated last year
YuanGongND / ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
☆447Updated last year
haoheliu / audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
☆351Updated 10 months ago
seungheondoh / lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆336Updated last year
lucidrains / BS-RoFormer
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs
☆588Updated this week
AMAAI-Lab / mustango
Mustango: Toward Controllable Text-to-Music Generation
☆373Updated 2 months ago
yangdongchao / UniAudio
The Open Source Code of UniAudio
☆572Updated last year
descriptinc / descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
☆1,521Updated this week
facebookresearch / AudioDec
An Open-source Streaming High-fidelity Neural Audio Codec
☆481Updated 5 months ago
gudgud96 / frechet-audio-distance
A lightweight library for Frechet Audio Distance calculation.
☆286Updated 11 months ago
adobe-research / DeepAFx-ST
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
☆388Updated 2 years ago
csteinmetz1 / auraloss
Collection of audio-focused loss functions in PyTorch
☆798Updated last year
zhangyongmao / VISinger2
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
☆343Updated 9 months ago
spotify-research / llark
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, an…
☆356Updated last year
wesbz / SoundStream
This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf
☆398Updated 3 years ago
zhvng / open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
☆549Updated 2 years ago