Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
☆358Feb 15, 2022Updated 4 years ago
Alternatives and similar repositories for lyrebird-wav2clip
Users that are interested in lyrebird-wav2clip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)☆863Sep 30, 2021Updated 4 years ago
- Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned S…☆54Dec 15, 2020Updated 5 years ago
- Contrastive Language-Audio Pretraining☆2,113May 15, 2025Updated 11 months ago
- Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)☆371Jul 12, 2024Updated last year
- VGGSound: A Large-scale Audio-Visual Dataset☆357Sep 13, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Audio Dataset for training CLAP and other models☆734Jan 8, 2026Updated 3 months ago
- Official PyTorch implementation of Contrastive Learning of Musical Representations☆335Jul 25, 2024Updated last year
- melodic object transcription framework☆26Nov 15, 2017Updated 8 years ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆472Apr 24, 2024Updated last year
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆419Aug 14, 2022Updated 3 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- A lightweight library for Frechet Audio Distance calculation.☆313Feb 11, 2026Updated 2 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆100Jul 24, 2024Updated last year
- Learning audio concepts from natural language supervision☆652Sep 18, 2024Updated last year
- ☆58Nov 2, 2020Updated 5 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆90Dec 20, 2024Updated last year
- Efficient Training of Audio Transformers with Patchout☆374Jan 12, 2024Updated 2 years ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆54Jul 16, 2025Updated 9 months ago
- This repo hosts the code and models of "Masked Autoencoders that Listen".☆657Apr 5, 2024Updated 2 years ago
- Making an AI-generated music video from any song with Wav2CLIP and VQGAN-CLIP☆246Jun 10, 2022Updated 3 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,448May 21, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆207Oct 6, 2025Updated 6 months ago
- PyTorch Dataset for Speech and Music audio