denfed/heartheflow

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/denfed/heartheflow)

denfed / heartheflow

Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"

☆12

Alternatives and similar repositories for heartheflow

Users that are interested in heartheflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

denfed / wave-spec-fusion
View on GitHub
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…
☆16Aug 9, 2021Updated 4 years ago
IFICL / SLfM
View on GitHub
Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
☆43Updated this week
YYX666660 / LAVSS
View on GitHub
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
☆19Feb 25, 2025Updated last year
IsraelCohenLab / ConstantBeamwidthUCCA
View on GitHub
☆11Jun 6, 2022Updated 4 years ago
VisualAIKHU / NoPrior_MultiSSL
View on GitHub
Official Repository for "Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge" (CVPR 2024)
☆16Sep 1, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hche11 / Localizing-Visual-Sounds-the-Hard-Way
View on GitHub
Localizing Visual Sounds the Hard Way
☆84Jul 6, 2022Updated 4 years ago
crlandsc / Model-based-Bayesian-DoA-Analysis-for-Sound-Sources-Using-a-Spherical-Microphone-Array
View on GitHub
A machine learning algorithm that estimates the directions of arrival and relative levels of an arbitrary number of sound sources using r…
☆12Dec 10, 2022Updated 3 years ago
OpenNLPLab / FNAC_AVL
View on GitHub
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…
☆29Apr 10, 2023Updated 3 years ago
yangyi0818 / DOA-estimation-with-a-stacked-self-attention-network
View on GitHub
A stacked self-attention network for two-dimensional direction-of-arrival estimation in hands-free speech communication
☆12Sep 12, 2024Updated last year
VisualAIKHU / SAMPD
View on GitHub
Official Repository for "Multispectral Pedestrian Detection with Sparsely Annotated Label" (AAAI 2025)
☆32Apr 28, 2025Updated last year
cevers / sap_locata_eval
View on GitHub
☆16Feb 6, 2020Updated 6 years ago
XuMengyaAmy / SwinMLP_TranCAP
View on GitHub
☆13Jun 26, 2022Updated 4 years ago
StevenHickson / CreateNormals
View on GitHub
☆11Nov 22, 2019Updated 6 years ago
GQBBBB / UCI
View on GitHub
☆10Oct 5, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
DA-MUSIC / DR-MUSIC_ICASSP23
View on GitHub
☆14May 27, 2023Updated 3 years ago
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
cjddny / cocos2d_guardCarrot
View on GitHub
cocos2d-x 保卫萝卜 C++
☆11Mar 1, 2016Updated 10 years ago
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago
LijunRio / A-Self-Guided-Framework
View on GitHub
This repository contains the code accompanying the paper "A Self-Guided Framework for Radiology Report Generation", accepted by MICCAI 20…
☆20Mar 11, 2024Updated 2 years ago
metu-sparg / higrid
View on GitHub
Hiearchical Grid Refinement (HiGRID): DOA Estimation using Rigid Spherical Microphone Arrays
☆14Apr 11, 2019Updated 7 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
cyh-0 / CAVP
View on GitHub
Official code for "A Closer Look at Audio-Visual Segmentation"
☆97Oct 31, 2025Updated 8 months ago
wang-zhanyu / MSAT
View on GitHub
Source code for the paper "A Medical Semantic-Assisted Transformer for Radiographic Report Generation"
☆25Jun 23, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zjsong / SSPL
View on GitHub
PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…
☆32Jul 8, 2024Updated 2 years ago
OpenNLPLab / ETSC-Exact-Toeplitz-to-SSM-Conversion
View on GitHub
[EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…
☆14Oct 17, 2023Updated 2 years ago
SAGNIKMJR / move2hear-active-AV-separation
View on GitHub
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
☆16Jun 17, 2026Updated last month
marmoi / dcase2021_task1a_baseline
View on GitHub
☆14Jun 9, 2021Updated 5 years ago
cvlab-kaist / UFC
View on GitHub
☆12Mar 17, 2024Updated 2 years ago
FutureTwT / HMAH
View on GitHub
The source code of "Teacher-Student Learning: Efficient Hierarchical Message Aggregation Hashing for Cross-Modal Retrieval." (Accepted by…
☆21Jun 7, 2022Updated 4 years ago
SonyResearch / dcase2025_stereo_seld_data_generator
View on GitHub
Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
☆17Jul 17, 2025Updated last year
jinxiang-liu / UFE-AVS
View on GitHub
Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""
☆19Jul 7, 2024Updated 2 years ago
FYJNEVERFOLLOWS / ResNet-STFT-SSL
View on GitHub
ResNet-STFT Model for Sound Source Localization
☆20Aug 25, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zjukongming / TranSQ
View on GitHub
MICCAI 22 accepted paper “TranSQ: Transformer-based Semantic Query for Medical Report Generation“ for medical report generation
☆27Sep 3, 2025Updated 10 months ago
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
View on GitHub
☆22Sep 19, 2024Updated last year
guotaowang / STANet
View on GitHub
☆16Sep 20, 2022Updated 3 years ago
rxtan2 / AVSeT
View on GitHub
☆17Oct 2, 2023Updated 2 years ago
BingYang-20 / DP-RTF-Learning
View on GitHub
A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]
☆28Feb 11, 2023Updated 3 years ago
rmithyx / maximum-likelihood-DOA-estimation-method-in-the-spherical-harmonic-domain
View on GitHub
A maximum likelihood direction of arrival estimation method for open-sphere microphone arrays in the spherical harmonic domain
☆26Jul 5, 2019Updated 7 years ago
Cu-OH-2 / computer-architecture-review
View on GitHub
同济大学软件学院《计算机系统结构》复习笔记
☆12Jun 19, 2025Updated last year