IFICL/stereocrw

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IFICL/stereocrw)

IFICL / stereocrw

Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation

☆28

Alternatives and similar repositories for stereocrw

Users that are interested in stereocrw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IFICL / SLfM
View on GitHub
Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
☆43Updated this week
cvlab-columbia / paperbot
View on GitHub
PaperBot: Learning to Design Real-World Tools Using Paper
☆13Mar 15, 2024Updated 2 years ago
YYX666660 / LAVSS
View on GitHub
Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation
☆19Feb 25, 2025Updated last year
tiangeluo / ShapeCompiler
View on GitHub
A Unified Framework for Transforming between Text, Point Cloud, and Program
☆19Jul 3, 2025Updated last year
stoneMo / AVGN
View on GitHub
Official implementation for AVGN
☆41Mar 24, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
YapengTian / CCOL-CVPR21
View on GitHub
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆26Nov 24, 2021Updated 4 years ago
SonyResearch / dcase2025_stereo_seld_data_generator
View on GitHub
Data generator for stereo sound event localization and detection task of DCASE 2025 challenge
☆17Jul 17, 2025Updated last year
Ego4DSounds / Ego4DSounds
View on GitHub
Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence
☆21Jun 14, 2024Updated 2 years ago
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
facebookresearch / viewseg
View on GitHub
Code for "Recognizing Scenes from Novel Viewpoints"
☆29Sep 16, 2022Updated 3 years ago
OpenNLPLab / MMVAE-AVS
View on GitHub
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆20Sep 19, 2024Updated last year
danielkrause / Moving-Binaural-SDEL
View on GitHub
Implementation of the paper "Binaural Sound Source Distance Estimation and Localization for a Moving Listener"
☆22Mar 2, 2025Updated last year
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jinlinyi / 3DFIRES
View on GitHub
[CVPR 2024] 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces
☆26Mar 28, 2024Updated 2 years ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
SheldonTsui / PseudoBinaural_CVPR2021
View on GitHub
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆72Jul 8, 2021Updated 5 years ago
XYPB / CondFoleyGen
View on GitHub
Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".
☆93Dec 8, 2023Updated 2 years ago
kleinfreund / balatrolator
View on GitHub
Balatro calculator
☆17Jul 10, 2026Updated last week
IFICL / images-that-sound
View on GitHub
Official repo for Images that sound: a special spectrogram that can be seen as images and played as sound generated by diffusions
☆252Updated this week
spatUV / fs-gcc
View on GitHub
Frequency-Sliding Generalized Cross Correlation for Time Delay Estimation
☆33Mar 24, 2020Updated 6 years ago
robot-learning-freiburg / MM-DistillNet
View on GitHub
PyTorch code for training MM-DistillNet for multimodal knowledge distillation. http://rl.uni-freiburg.de/research/multimodal-distill
☆57May 17, 2021Updated 5 years ago
jasonbian97 / flowwalk
View on GitHub
Implementation for flowwalk
☆33Mar 27, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mbanani / byoc
View on GitHub
[ICCV 2021 - Oral] Bootstrap Your Own Correspondences
☆41Dec 10, 2021Updated 4 years ago
nileshkulkarni / scene_drdf
View on GitHub
Website and Code for Directed Ray Distance Functions for 3D Scene Reconstruction
☆38Sep 13, 2023Updated 2 years ago
vTAD2025-Challenge / vTAD
View on GitHub
☆16Oct 24, 2025Updated 8 months ago
zinengtang / TVLT
View on GitHub
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
☆127Feb 24, 2023Updated 3 years ago
Gaiejj / align-anything
View on GitHub
☆16Nov 11, 2025Updated 8 months ago
jagabandhumishra / IEEE-Summer-School
View on GitHub
☆11Aug 3, 2021Updated 4 years ago
facebookresearch / replay_dataset
View on GitHub
Download scripts and tools for Replay dataset.
☆39Jun 23, 2023Updated 3 years ago
MuSAELab / amplitude-modulation-analysis-matlab
View on GitHub
Amplitude Modulation Analysis Toolbox for MATLAB / Octave
☆19Sep 30, 2022Updated 3 years ago
kdexd / coco-rem
View on GitHub
Code for the paper "Benchmarking Object Detectors with COCO: A New Path Forward."
☆36Jul 13, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
guxm2021 / ALT_SpeechBrain
View on GitHub
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
☆51May 7, 2024Updated 2 years ago
articulatory / articulatory
View on GitHub
Deep Articulatory Synthesis and Inversion
☆57Feb 14, 2024Updated 2 years ago
aromanusc / SoundQ
View on GitHub
Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)
☆14Mar 21, 2025Updated last year
cevers / sap_locata_io
View on GitHub
☆18Jan 31, 2020Updated 6 years ago
liuhuadai / AudioLCM
View on GitHub
PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.
☆13Jun 15, 2024Updated 2 years ago
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago