This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
☆257Jul 25, 2024Updated last year
Alternatives and similar repositories for WavCaps
Users that are interested in WavCaps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper 'Audio Captioning Transformer'☆56Jan 18, 2022Updated 4 years ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆200Dec 13, 2024Updated last year
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆207Oct 6, 2025Updated 6 months ago
- Audio Captioning datasets for PyTorch.☆128Mar 25, 2026Updated 2 weeks ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆473Apr 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for CVSSP submission to DCASE 2021 Task 6