☆17Aug 11, 2023Updated 2 years ago
Alternatives and similar repositories for PSTP-Net
Users that are interested in PSTP-Net are comparing it to the libraries listed below
Sorting:
- MUSIC-AVQA, CVPR2022 (ORAL)☆96Dec 30, 2022Updated 3 years ago
- Multi-Scale Attention for Audio Question Answering☆28Jul 19, 2023Updated 2 years ago
- Towards Long Form Audio-visual Video Understanding☆15Jan 16, 2026Updated last month
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆16Aug 17, 2023Updated 2 years ago
- ☆13Feb 26, 2024Updated 2 years ago
- Official Repository for "Learning Trimodal Relation for Audio-Visual Question Answering with Missing Modality" (ECCV 2024)☆16Oct 29, 2024Updated last year
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- ☆14Nov 13, 2023Updated 2 years ago
- Official Repository for "Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization" (ACM MM 2023)☆18Nov 14, 2023Updated 2 years ago
- [CVPR 2025] Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation☆80Dec 24, 2025Updated 2 months ago
- Official implementation for MGN☆20Dec 22, 2022Updated 3 years ago
- ☆34Sep 29, 2024Updated last year
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- ☆10Feb 10, 2026Updated 2 weeks ago
- ☆36Jul 9, 2025Updated 7 months ago
- LED : Light Enhanced Depth Estimation at Night☆13Dec 9, 2025Updated 2 months ago
- ☆10Nov 15, 2023Updated 2 years ago
- ☆12Jun 26, 2024Updated last year
- P1AC: Revisiting Absolute Pose From a Single Affine Correspondence☆11Mar 19, 2024Updated last year
- ☆13May 21, 2024Updated last year
- ☆18Dec 8, 2024Updated last year
- ☆10Aug 29, 2023Updated 2 years ago
- [ICCV2023] DR-Tune: Improving Fine-tuning of Pretrained Visual Models by Distribution Regularization with Semantic Calibration☆12Oct 12, 2023Updated 2 years ago
- ☆10Nov 21, 2023Updated 2 years ago
- Code for paper: "RemovalNet: DNN model fingerprinting removal attack", IEEE TDSC 2023.☆10Nov 27, 2023Updated 2 years ago
- ☆13Jan 13, 2025Updated last year
- ☆10Aug 24, 2023Updated 2 years ago
- ☆12Mar 5, 2025Updated 11 months ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- YOLO for Uniform Directed Object detection☆13Mar 28, 2024Updated last year
- ☆10Jan 6, 2020Updated 6 years ago
- ☆10Oct 4, 2023Updated 2 years ago
- ☆12Jun 9, 2025Updated 8 months ago
- ☆10Apr 17, 2024Updated last year
- Dataset for bounding box labels and terrain meshes of the POLAR database☆10Jul 10, 2025Updated 7 months ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Aug 11, 2023Updated 2 years ago
- ☆14Jun 20, 2023Updated 2 years ago
- ☆13Feb 28, 2024Updated 2 years ago
- ViT models pretrained with up to ~5k hours of human-like video data☆14Aug 10, 2023Updated 2 years ago