LAION-AI / CLAPLinks

Contrastive Language-Audio Pretraining

☆1,787

Alternatives and similar repositories for CLAP

Users that are interested in CLAP are comparing it to the libraries listed below

Sorting:

LAION-AI / audio-dataset
Audio Dataset for training CLAP and other models
☆701Updated last year
microsoft / CLAP
Learning audio concepts from natural language supervision
☆584Updated 11 months ago
archinetai / audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
☆2,067Updated 2 years ago
lucidrains / audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
☆2,578Updated 7 months ago
AndreyGuzhov / AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
☆833Updated 3 years ago
NVIDIA / BigVGAN
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,085Updated 11 months ago
YuanGongND / ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,330Updated 2 years ago
facebookresearch / AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
☆610Updated last year
descriptinc / descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
☆1,539Updated this week
teticio / audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.
☆770Updated 10 months ago
Yuan-ManX / ai-audio-datasets
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI mo…
☆810Updated last month
bytedance / SALMONN
SALMONN family: A suite of advanced multi-modal LLMs
☆1,300Updated last month
EmulationAI / awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
☆689Updated last year
yizhilll / MERT
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆393Updated 2 months ago
YuanGongND / ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
☆450Updated last year
archinetai / audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
☆1,902Updated last year
haoheliu / audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
☆353Updated 10 months ago
seungheondoh / lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
☆337Updated last year
zhvng / open-musiclm
Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.
☆549Updated 2 years ago
spotify-research / llark
Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, an…
☆358Updated last year
declare-lab / tango
A family of diffusion models for text-to-audio generation.
☆1,189Updated 3 weeks ago
yangdongchao / UniAudio
The Open Source Code of UniAudio
☆573Updated last year
NVIDIA / audio-flamingo
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
☆714Updated this week
haoheliu / AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
☆2,723Updated last month
lucidrains / voicebox-pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
☆659Updated 10 months ago
gemelo-ai / vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
☆971Updated last year
Audio-AGI / AudioSep
Official implementation of "Separate Anything You Describe"
☆1,773Updated 8 months ago
iver56 / torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,074Updated 7 months ago
facebookresearch / audiobox-aesthetics
Unified automatic quality assessment for speech, music, and sound.
☆566Updated 2 months ago
csteinmetz1 / auraloss
Collection of audio-focused loss functions in PyTorch
☆802Updated last year