Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
☆359Feb 15, 2022Updated 4 years ago
Alternatives and similar repositories for lyrebird-wav2clip
Users that are interested in lyrebird-wav2clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆868Sep 30, 2021Updated 4 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆54Dec 15, 2020Updated 5 years ago
- Contrastive Language-Audio Pretraining☆2,157May 15, 2025Updated last year
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆371Jul 12, 2024Updated last year
- VGGSound: A Large-scale Audio-Visual Dataset☆357Sep 13, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Audio Dataset for training CLAP and other models☆738Jan 8, 2026Updated 4 months ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆336Jul 25, 2024Updated last year
- melodic object transcription framework☆26Nov 15, 2017Updated 8 years ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆472Apr 24, 2024Updated 2 years ago
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆423Aug 14, 2022Updated 3 years ago
- A lightweight library for Frechet Audio Distance calculation.☆313Feb 11, 2026Updated 3 months ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Learning audio concepts from natural language supervision☆663Sep 18, 2024Updated last year
- ☆58Nov 2, 2020Updated 5 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆92Dec 20, 2024Updated last year
- Efficient Training of Audio Transformers with Patchout☆382Jan 12, 2024Updated 2 years ago
- Code for "Audio Retrieval with Natural Language Queries: A Benchmark Study", Transactions on Multimedia 2022☆54Jul 16, 2025Updated 10 months ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,458May 21, 2023Updated 3 years ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆664Apr 5, 2024Updated 2 years ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆245Jun 10, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆211Oct 6, 2025Updated 7 months ago
- PyTorch Dataset for Speech and Music audio☆79Jul 12, 2024Updated last year
- Evaluation kit for the HEAR Benchmark☆63Feb 12, 2026Updated 3 months ago
- An Audio Language model for Audio Tasks☆321Apr 19, 2024Updated 2 years ago
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆261Jul 25, 2024Updated last year
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆56Jan 29, 2024Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Pitch Estimating Neural Networks (PENN)☆272Apr 2, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python library for downloading, loading & working with sound datasets☆355Sep 23, 2025Updated 8 months ago
- BAD-VAE: A VAE framework for unsupervised disentanglement of sequential data☆12May 25, 2022Updated 4 years ago
- This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …☆345Nov 20, 2024Updated last year
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆291Mar 20, 2024Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆212Jul 14, 2022Updated 3 years ago
- A python script for extracting loops from audio files.☆53Jul 26, 2024Updated last year
- ☆511Jun 25, 2024Updated last year