This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"
☆25Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for TCAF-GZSL
Users that are interested in TCAF-GZSL are comparing it to the libraries listed below
Sorting:
- This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆43Nov 29, 2022Updated 3 years ago
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆25May 18, 2023Updated 2 years ago
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21☆13Jul 15, 2022Updated 3 years ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆54Jul 16, 2025Updated 7 months ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- ☆28Mar 15, 2021Updated 4 years ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Jan 29, 2024Updated 2 years ago
- ☆31Sep 20, 2021Updated 4 years ago
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- Deep learning application to identify ingredients from cooking dishes images and recommend dishes to cook, given a set of ingredients. Th…☆32Jan 26, 2023Updated 3 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 2 months ago
- TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation☆12Jul 14, 2022Updated 3 years ago
- ☆10Oct 24, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆14Jun 14, 2024Updated last year
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆21Jan 27, 2026Updated last month
- [NeurIPS 2025] This is the official repository for "RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis"☆26Nov 21, 2025Updated 3 months ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 3 years ago
- Measure the diversity of image descriptions, repository for our COLING 2018 paper.☆13Dec 29, 2019Updated 6 years ago
- Implementation of PPO for CartPole-v1☆10Jan 1, 2019Updated 7 years ago
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆15Feb 9, 2026Updated 2 weeks ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 8 years ago
- ☆10Oct 17, 2022Updated 3 years ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- ☆11Jan 29, 2023Updated 3 years ago
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- ☆12Jan 10, 2025Updated last year
- ☆11May 7, 2022Updated 3 years ago
- ☆13May 12, 2025Updated 9 months ago
- Spectral Clustering in C++☆17Jan 8, 2013Updated 13 years ago
- ☆21Feb 13, 2026Updated 2 weeks ago
- [Neurocomputing] EmoVerse: Enhancing Multimodal Large Language Models for Affective Computing via Multitask Learning☆16Jul 6, 2025Updated 7 months ago
- ICCV 2023 - AdaptGuard: Defending Against Universal Attacks for Model Adaptation☆11Dec 23, 2023Updated 2 years ago
- Convolutional Fine-Grained Classification with Self-Supervised Target Relation Regularization (TIP 2022)☆12Sep 8, 2022Updated 3 years ago
- [ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction☆10Aug 17, 2023Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Learning Pytorch☆13Jun 12, 2018Updated 7 years ago