YYX666660/LAVSS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YYX666660/LAVSS)

YYX666660 / LAVSS

Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation

☆19

Alternatives and similar repositories for LAVSS

Users that are interested in LAVSS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YapengTian / CCOL-CVPR21
View on GitHub
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆26Nov 24, 2021Updated 4 years ago
SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago
pedro-morgado / AVSpatialAlignment
View on GitHub
☆31Jun 14, 2022Updated 4 years ago
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
SAGNIKMJR / move2hear-active-AV-separation
View on GitHub
Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)
☆16Jun 17, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
facebookresearch / real-acoustic-fields
View on GitHub
Real Acoustic Fields An Audio-Visual Room Acoustics Dataset and Benchmark
☆64Aug 29, 2024Updated last year
IFICL / stereocrw
View on GitHub
Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation
☆28Mar 15, 2023Updated 3 years ago
SheldonTsui / PseudoBinaural_CVPR2021
View on GitHub
Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)
☆72Jul 8, 2021Updated 5 years ago
WikiChao / DAVIS
View on GitHub
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …
☆33Mar 30, 2026Updated 3 months ago
fotisdr / DNN-HA
View on GitHub
DNN-based hearing aid for real-time sound processing
☆25May 25, 2023Updated 3 years ago
partha2409 / DCASE2025_seld_baseline
View on GitHub
☆27May 27, 2025Updated last year
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago
JiabenChen / iQuery
View on GitHub
[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation
☆73Jul 25, 2023Updated 3 years ago
anton-jeran / AV-RIR
View on GitHub
Audio-Visual Room Impulse Response Estimation
☆25Jul 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yu-Wu / Modaily-Aware-Audio-Visual-Video-Parsing
View on GitHub
Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
☆24Dec 29, 2021Updated 4 years ago
TeaPoly / PLCPA-ASYM-Loss
View on GitHub
The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss
☆15Sep 4, 2023Updated 2 years ago
buptexplorers / OFB-VR
View on GitHub
☆12Mar 17, 2020Updated 6 years ago
ubc-vision / TriBERT
View on GitHub
Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…
☆14Dec 9, 2021Updated 4 years ago
facebookresearch / 2.5D-Visual-Sound
View on GitHub
2.5D visual sound
☆121Jul 25, 2023Updated 3 years ago
jinbae-s / ACVIS
View on GitHub
[ICASSP 2026] The official pytorch implementation of ACVIS
☆15Jan 19, 2026Updated 6 months ago
zeroone-universe / TowardsRobustSpeechSR
View on GitHub
Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"
☆10May 8, 2023Updated 3 years ago
V-Sense / 360AudioVisual
View on GitHub
This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality
☆13Jul 2, 2019Updated 7 years ago
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / FAIR-Play
View on GitHub
2.5D visual sound dataset
☆108Sep 21, 2021Updated 4 years ago
StevenHickson / CreateNormals
View on GitHub
☆11Nov 22, 2019Updated 6 years ago
Crystalsound / FRN
View on GitHub
☆28Apr 17, 2023Updated 3 years ago
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
tljxyys / RDSTN_ultrasound
View on GitHub
[ICASSP2024] This repo holds the code for work "Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging"
☆19Apr 10, 2024Updated 2 years ago
Ego4DSounds / Ego4DSounds
View on GitHub
Ego4DSounds: A diverse egocentric dataset with high action-audio correspondence
☆21Jun 14, 2024Updated 2 years ago
Gaiejj / align-anything
View on GitHub
☆16Nov 11, 2025Updated 8 months ago
vvvb-github / AVSegFormer
View on GitHub
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
☆74Mar 6, 2025Updated last year
fengpeng-yue / ASRTTS
View on GitHub
ASR & TTS joint training, asr, tts, machine speech chain
☆16Oct 16, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jaeyeonkim99 / visage
View on GitHub
Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)
☆47Sep 10, 2025Updated 10 months ago
BASHLab / OWL
View on GitHub
☆15May 25, 2026Updated 2 months ago
stoneMo / EZ-VSL
View on GitHub
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
☆42Oct 2, 2022Updated 3 years ago
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
AmandineBtto / NeRAF
View on GitHub
[ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.
☆37Mar 11, 2026Updated 4 months ago
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago
FannyChao / AVS360_audiovisual_saliency_360
View on GitHub
Towards Audio-Visual Saliency Prediction for Omnidirectional Video with Spatial Audio
☆20Dec 28, 2021Updated 4 years ago