liyidi/soundnet_localize_sound_source

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liyidi/soundnet_localize_sound_source)

liyidi / soundnet_localize_sound_source

soundnet and localize sound source

☆12

Alternatives and similar repositories for soundnet_localize_sound_source

Users that are interested in soundnet_localize_sound_source are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ardasnck / learning_to_localize_sound_source
View on GitHub
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
☆102Dec 4, 2024Updated last year
longtaoge / ViedioTest
View on GitHub
视频抽帧
☆13Aug 30, 2015Updated 10 years ago
seorim0 / ResUNet-LC
View on GitHub
2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification
☆15Jan 4, 2024Updated 2 years ago
FloretCat / CMRAN
View on GitHub
Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization， ACM MM 2020
☆33Nov 6, 2020Updated 5 years ago
ca-joe-yang / OilPainting
View on GitHub
☆12Apr 26, 2018Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jasongief / PSP_CVPR_2021
View on GitHub
[2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line
☆42Jul 5, 2022Updated 4 years ago
RxAI-dev / rxlm
View on GitHub
Reactive AI - RxLM: Reactive Language Models - training and inference framework. Part of RxNN Platform Ecosystem. Licensed under custom "…
☆25May 27, 2026Updated last month
denfed / heartheflow
View on GitHub
Repository for the 2023 WACV paper: "Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization"
☆12Dec 21, 2022Updated 3 years ago
sasi433 / Sound-source-localization
View on GitHub
Developing an algorithm using MATLAB to detect the unknown location(coordinates) of a sound source in a closed room using a series of mic…
☆14Jan 10, 2018Updated 8 years ago
JyothiM93 / Applied-Signal-Processing-to-SoundSourceLocalisation
View on GitHub
Goal is to estimate the location of sound source using microphones array. LMS method is used to estimate time delays. Steepest descent al…
☆14Oct 27, 2017Updated 8 years ago
echocatzh / Demo-of-DeepComplexAEC
View on GitHub
☆11Jun 15, 2022Updated 4 years ago
chimechallenge / chime5-synchronisation
View on GitHub
CHiME-5 Baseline Array Synchronisation
☆12Sep 24, 2018Updated 7 years ago
JoaquinChou / DAMAS_FISTA_Net
View on GitHub
☆24Jul 12, 2024Updated 2 years ago
tanglang96 / particle-filter
View on GitHub
particle filter based object tracking
☆17Mar 9, 2020Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
adrianSRoman / DeepWaveDOA
View on GitHub
ICASSP 2024: Robust DOA estimation from deep acoustic imaging
☆25Apr 14, 2024Updated 2 years ago
knowledgetechnologyuhh / gasp
View on GitHub
☆12Jun 2, 2025Updated last year
RenePotocnik / image-to-stereographic-projection
View on GitHub
Convert an image to stereographic projection (Polar Coordinates)
☆10Oct 15, 2022Updated 3 years ago
cwc1260 / HandFold
View on GitHub
☆33Mar 21, 2022Updated 4 years ago
massens / salnet-keras
View on GitHub
SalNet on Keras: A deep convolutional network for saliency prediction
☆11Jun 23, 2017Updated 9 years ago
shvdiwnkozbw / Multi-Source-Sound-Localization
View on GitHub
This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.
☆96Oct 18, 2021Updated 4 years ago
rfalcon100 / seld_dcase2022_ric
View on GitHub
My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.
☆12Nov 12, 2022Updated 3 years ago
RangHo / webrtc-delay-estimation
View on GitHub
Delay estimation logic extracted from WebRTC
☆18Jan 11, 2021Updated 5 years ago
KevinToodlepoot / MS-Exciter
View on GitHub
A JUCE based stereo expander and harmonic exciter using Mid-Side processing and tube-based distortion for increased stereo width and harm…
☆18Nov 21, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aurooj / WeakGroundedVQA_Capsules
View on GitHub
☆18Apr 10, 2023Updated 3 years ago
pseeth / soundnet_keras
View on GitHub
SoundNet, built in Keras with pre-trained 8-layer model.
☆29Oct 15, 2019Updated 6 years ago
bubaimaji / cmt-mser
View on GitHub
"MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23
☆24Feb 26, 2023Updated 3 years ago
I2-Multimedia-Lab / 360-video-experimental-platform
View on GitHub
(TMM 2022)Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach
☆12Jun 16, 2021Updated 5 years ago
tjdevWorks / TEASEL
View on GitHub
☆26May 8, 2022Updated 4 years ago
zhangkao / IIP_STRNN_Saliency
View on GitHub
A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction (TIP2021)
☆13Jul 7, 2022Updated 4 years ago
pgalatic / fast-artistic-videos-pytorch
View on GitHub
Fast Artistic Videos in pyTorch
☆14Oct 3, 2023Updated 2 years ago
GeraldHan / GGE
View on GitHub
Code for Greedy Gradient Ensemble for Visual Question Answering （ICCV 2021, Oral）
☆27Mar 28, 2022Updated 4 years ago
William1617 / dtln_aec
View on GitHub
☆24Mar 18, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
YapengTian / AVE-ECCV18
View on GitHub
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
☆210Apr 3, 2021Updated 5 years ago
yangyi0818 / DPARNet
View on GitHub
Dual-Path Attention and Recurrent Network for speech separation
☆22Sep 12, 2024Updated last year
edurnebernal / SST-Sal
View on GitHub
SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360º videos
☆15Aug 31, 2023Updated 2 years ago
xianyuzinc / DAMAS_code
View on GitHub
本项目实现了一个完整的声源定位与声压级分析系统，包括波束形成、DAMAS系列算法以及FISTA算法等多种声源定位方法。系统能够处理多频率声源信号，生成声源定位图像，并分析不同方法下的声压级特性.
☆34Jan 10, 2025Updated last year
SWPark92 / SphereGAN
View on GitHub
☆14Aug 8, 2019Updated 6 years ago
cozcinar / 360_Audio_Visual_ICMEW2020
View on GitHub
Audio-Visual Perception of Omnidirectional Video for Virtual Reality Applications
☆15Feb 22, 2023Updated 3 years ago
FannyChao / MV-SalGAN360
View on GitHub
The improved version of our previous work SalGAN360 which predict visual saliency on 360° image
☆16Jan 19, 2021Updated 5 years ago