Python code for handling the Clotho dataset.
☆85Nov 24, 2020Updated 5 years ago
Alternatives and similar repositories for clotho-dataset
Users that are interested in clotho-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for the evaluation of audio captioning.☆19May 23, 2020Updated 5 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- Dataset and baseline for the first Audiocaption task☆79Jul 25, 2024Updated last year
- 🔊 Repository for our NAACL-HLT 2019 paper: AudioCaps☆206Oct 6, 2025Updated 5 months ago
- Code for phase recovery in MadTwinNet for monaural singing voice separation☆12Jul 17, 2018Updated 7 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Aug 22, 2023Updated 2 years ago
- Source code for the paper 'Audio Captioning Transformer'☆56Jan 18, 2022Updated 4 years ago
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆257Jul 25, 2024Updated last year
- A list of papers about audio captioning☆79Jul 1, 2022Updated 3 years ago
- Audio captioning recipe☆51Oct 23, 2025Updated 5 months ago
- Consistent dictionary learning algorithm for signal declipping (Python code)☆20Oct 24, 2018Updated 7 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- Unsupervised Domain Adaptation for Acoustic Scene Classification with Wasserstein Distance☆14Sep 16, 2020Updated 5 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 5 years ago
- Repository for subjective and objective evaluation of source separation algorithms☆12Apr 18, 2018Updated 7 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- ☆14Apr 18, 2019Updated 6 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆354Sep 13, 2021Updated 4 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- ☆14Mar 25, 2023Updated 2 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)☆11Aug 12, 2020Updated 5 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆54Jul 16, 2025Updated 8 months ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 8 months ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- This package aims at simplifying the download of the AudioCaps dataset.☆36Dec 1, 2023Updated 2 years ago
- ☆19May 9, 2019Updated 6 years ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆198Dec 13, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 10 months ago
- ☆12Nov 23, 2020Updated 5 years ago
- Audio Captioning datasets for PyTorch.☆127Updated this week
- ☆16Feb 10, 2026Updated last month