GeWu-Lab/PSTP-Net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GeWu-Lab/PSTP-Net)

GeWu-Lab / PSTP-Net

☆17

Alternatives and similar repositories for PSTP-Net

Users that are interested in PSTP-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
GeWu-Lab / MUSIC-AVQA
View on GitHub
MUSIC-AVQA, CVPR2022 (ORAL)
☆100Dec 30, 2022Updated 3 years ago
fyyCS / LSLD
View on GitHub
☆14Nov 13, 2023Updated 2 years ago
AlyssaYoung / AVQA
View on GitHub
ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos
☆15Aug 17, 2023Updated 2 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
stoneMo / MGN
View on GitHub
Official implementation for MGN
☆20Dec 22, 2022Updated 3 years ago
mira-ai-lab / MUSIC-AVQA-R
View on GitHub
☆13May 21, 2024Updated 2 years ago
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
weiguoPian / AV-CIL_ICCV2023
View on GitHub
[ICCV 2023] Audio-Visual Class-Incremental Learning
☆35Sep 29, 2024Updated last year
GeWu-Lab / BML_TPAMI2024
View on GitHub
The repo for "On-the-fly Modulation for Balanced Multimodal Learning", T-PAMI 2024
☆19Sep 29, 2024Updated last year
advanc3dUA / WohnungSuchen
View on GitHub
🏠🔍 Auto check for new apartments in Hamburg from various real estate provides
☆16Apr 15, 2026Updated 3 months ago
liuxubo717 / LASS-demopage
View on GitHub
☆19Sep 2, 2022Updated 3 years ago
Chunmian-art / City-3DQA
View on GitHub
☆23Apr 19, 2024Updated 2 years ago
GenjiB / LAVISH
View on GitHub
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆106Aug 11, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jasongief / OV-AVEL
View on GitHub
[2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization
☆46Mar 7, 2025Updated last year
Zhang-VISLab / NeurIPS2023-InfoCD
View on GitHub
The official repository of the paper "InfoCD: A Contrastive Chamfer Distance Loss for Point Cloud Completion" published at NeurIPS 2023
☆23Oct 13, 2023Updated 2 years ago
StanfordVL / Sonicverse
View on GitHub
☆22Mar 18, 2023Updated 3 years ago
JHome1 / GiO-GiT
View on GitHub
☆18Sep 29, 2025Updated 9 months ago
liuxubo717 / sound_generation
View on GitHub
Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021
☆69Sep 3, 2021Updated 4 years ago
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
ktatar / rawaudiovae
View on GitHub
☆12Jun 9, 2025Updated last year
GeWu-Lab / awesome-audiovisual-learning
View on GitHub
A curated list of audio-visual learning methods and datasets.
☆288Dec 3, 2024Updated last year
YapengTian / AVE-ECCV18
View on GitHub
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆210Apr 3, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Mr-Neko / JM3D
View on GitHub
The offical implemention of JM3D.
☆31Apr 8, 2026Updated 3 months ago
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
ederwander / Beat-Track
View on GitHub
☆15Aug 31, 2015Updated 10 years ago
GaussianCube / GaussianCube_Construction
View on GitHub
☆15Jun 13, 2024Updated 2 years ago
qywu / FaceChat
View on GitHub
☆15Feb 28, 2023Updated 3 years ago
teenageengineering / Lasp
View on GitHub
Low-latency Audio Signal Processing plugin for Unity
☆12Dec 25, 2021Updated 4 years ago
JJJYmmm / Pix2SeqV2-Pytorch
View on GitHub
Simple Implementation of Pix2seqV2(multi-task)
☆26Dec 16, 2024Updated last year
pierrecouprie / MotusLabTool
View on GitHub
MotusLabTool is a software developed to record acousmatic music interpretation.
☆14Nov 9, 2025Updated 8 months ago
adaptive-intelligent-robotics / AURORA
View on GitHub
Repository hosting the code associated with "Unsupervised Behaviour Discovery with Quality-Diversity Optimisation"
☆16Jun 14, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chriskiefer / libcccrt
View on GitHub
A collection of signal analysis functions related to complexity, chaos and causality. Optimised for realtime signal processing.
☆18Jan 15, 2024Updated 2 years ago
audiocontentanalysis / conferences
View on GitHub
MIR conference deadline countdowns
☆11Jul 14, 2026Updated last week
liuxubo717 / V-ACT
View on GitHub
Visually-Aware Audio Captioning
☆43Mar 3, 2023Updated 3 years ago
ayesha-ishaq / Open3DTrack
View on GitHub
Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
☆34Mar 14, 2025Updated last year
zdyshine / beat_track_mgtv_baseline
View on GitHub
☆16Jul 20, 2021Updated 5 years ago
RoyiRa / GRADE-Quantifying-sample-diversity-in-text-to-image-models
View on GitHub
☆12Mar 5, 2025Updated last year
cyh-0 / CAVP
View on GitHub
Official code for "A Closer Look at Audio-Visual Segmentation"
☆97Oct 31, 2025Updated 8 months ago