Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
☆357Feb 15, 2022Updated 4 years ago
Alternatives and similar repositories for lyrebird-wav2clip
Users that are interested in lyrebird-wav2clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆862Sep 30, 2021Updated 4 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆54Dec 15, 2020Updated 5 years ago
- Contrastive Language-Audio Pretraining☆2,078May 15, 2025Updated 10 months ago
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆371Jul 12, 2024Updated last year
- VGGSound: A Large-scale Audio-Visual Dataset☆355Sep 13, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Audio Dataset for training CLAP and other models☆732Jan 8, 2026Updated 2 months ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆335Jul 25, 2024Updated last year
- melodic object transcription framework☆26Nov 15, 2017Updated 8 years ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆473Apr 24, 2024Updated last year
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆418Aug 14, 2022Updated 3 years ago
- A lightweight library for Frechet Audio Distance calculation.☆312Feb 11, 2026Updated last month
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Learning audio concepts from natural language supervision☆651Sep 18, 2024Updated last year
- ☆58Nov 2, 2020Updated 5 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆90Dec 20, 2024Updated last year
- Efficient Training of Audio Transformers with Patchout☆371Jan 12, 2024Updated 2 years ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆54Jul 16, 2025Updated 8 months ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆245Jun 10, 2022Updated 3 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,440May 21, 2023Updated 2 years ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆654Apr 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆207Oct 6, 2025Updated 5 months ago
- PyTorch Dataset for Speech and Music audio☆79Jul 12, 2024Updated last year
- Evaluation kit for the HEAR Benchmark☆63Feb 12, 2026Updated last month
- An Audio Language model for Audio Tasks☆319Apr 19, 2024Updated last year
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆257Jul 25, 2024Updated last year
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆55Jan 29, 2024Updated 2 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- Pitch Estimating Neural Networks (PENN)☆271Apr 2, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Python library for downloading, loading & working with sound datasets☆352Sep 23, 2025Updated 6 months ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆335Nov 20, 2024Updated last year
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 3 years ago
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆288Mar 20, 2024Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆210Jul 14, 2022Updated 3 years ago
- A python script for extracting loops from audio files.☆52Jul 26, 2024Updated last year
- ☆509Jun 25, 2024Updated last year