Sreyan88 / RECAP
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆11Updated 7 months ago
Alternatives and similar repositories for RECAP:
Users that are interested in RECAP are comparing it to the libraries listed below
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆14Updated 7 months ago
- Streaming Vocos☆20Updated last month
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- ☆27Updated 7 months ago
- A spoken version of the textual story cloze benchmark☆14Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆30Updated last year