Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
☆359Feb 15, 2022Updated 4 years ago
Alternatives and similar repositories for lyrebird-wav2clip
Users that are interested in lyrebird-wav2clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆871Sep 30, 2021Updated 4 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆53Dec 15, 2020Updated 5 years ago
- Contrastive Language-Audio Pretraining☆2,178May 15, 2025Updated last year
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆372Jul 12, 2024Updated last year
- VGGSound: A Large-scale Audio-Visual Dataset☆359Sep 13, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Audio Dataset for training CLAP and other models☆740Jan 8, 2026Updated 5 months ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆337Jul 25, 2024Updated last year
- melodic object transcription framework☆26Nov 15, 2017Updated 8 years ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆474Apr 24, 2024Updated 2 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆426Aug 14, 2022Updated 3 years ago
- A lightweight library for Frechet Audio Distance calculation.☆315Feb 11, 2026Updated 4 months ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Learning audio concepts from natural language supervision☆665Sep 18, 2024Updated last year
- ☆58Nov 2, 2020Updated 5 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆92Dec 20, 2024Updated last year
- Efficient Training of Audio Transformers with Patchout☆383Jan 12, 2024Updated 2 years ago
- Code for "Audio Retrieval with Natural Language Queries: A Benchmark Study", Transactions on Multimedia 2022☆54Jul 16, 2025Updated 11 months ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,457May 21, 2023Updated 3 years ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆664Apr 5, 2024Updated 2 years ago
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆211Oct 6, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch Dataset for Speech and Music audio☆79Jul 12, 2024Updated last year
- Evaluation kit for the HEAR Benchmark☆63Feb 12, 2026Updated 4 months ago
- An Audio Language model for Audio Tasks☆322Apr 19, 2024Updated 2 years ago
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆261Jul 25, 2024Updated last year
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆56Jan 29, 2024Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Pitch Estimating Neural Networks (PENN)☆275Apr 2, 2025Updated last year
- Python library for downloading, loading & working with sound datasets☆355Sep 23, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 4 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆351Nov 20, 2024Updated last year
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆292Mar 20, 2024Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆212Jul 14, 2022Updated 3 years ago
- A python script for extracting loops from audio files.☆53Jul 26, 2024Updated last year
- ☆511Jun 25, 2024Updated last year
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆61Jan 19, 2022Updated 4 years ago