wsntxxn / TextToAudioGroundingLinks
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆45Updated 3 months ago
Alternatives and similar repositories for TextToAudioGrounding
Users that are interested in TextToAudioGrounding are comparing it to the libraries listed below
Sorting:
- ☆42Updated 2 years ago
- Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'☆49Updated 3 years ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆57Updated last year
- Source code for the paper 'Audio Captioning Transformer'☆57Updated 3 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Updated 2 years ago
- Tools for the evaluation of audio captioning.☆17Updated 5 years ago
- Audio captioning recipe☆49Updated 10 months ago