karreny/telling-left-from-right

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/karreny/telling-left-from-right)

karreny / telling-left-from-right

Project website for "Telling left from right: Learning spatial correspondence between sight and sound"

☆29

Alternatives and similar repositories for telling-left-from-right

Users that are interested in telling-left-from-right are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KranthiKumarR / Localize-to-Binauralize
View on GitHub
Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)
☆10Oct 11, 2021Updated 4 years ago
pedro-morgado / AVSpatialAlignment
View on GitHub
☆31Jun 14, 2022Updated 4 years ago
MRSAudio / MRSAudio_Main
View on GitHub
MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations
☆43Oct 15, 2025Updated 9 months ago
rhgao / co-separation
View on GitHub
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆98Jul 25, 2023Updated 3 years ago
facebookresearch / FAIR-Play
View on GitHub
2.5D visual sound dataset
☆108Sep 21, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ardasnck / learning_to_localize_sound_source
View on GitHub
Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes
☆102Dec 4, 2024Updated last year
fallonchen / ismir-klio
View on GitHub
Code supporting the ISMIR 2020 Klio Tutorial
☆20Oct 11, 2020Updated 5 years ago
SheldonTsui / SepStereo_ECCV2020
View on GitHub
Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)
☆72Oct 20, 2020Updated 5 years ago
hohsiangwu / rethinking-visual-sound-localization
View on GitHub
Official implementation of the paper How to Listen? Rethinking Visual Sound Localization
☆18Apr 25, 2022Updated 4 years ago
SVDDChallenge / CtrSVDD_Utils
View on GitHub
☆18Jan 10, 2024Updated 2 years ago
yzyouzhang / Empirical-Channel-CM
View on GitHub
Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …
☆19Feb 15, 2022Updated 4 years ago
stoneMo / MGN
View on GitHub
Official implementation for MGN
☆20Dec 22, 2022Updated 3 years ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
hearbenchmark / hear-eval-kit
View on GitHub
Evaluation kit for the HEAR Benchmark
☆65Feb 12, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sonalkum / MMAUPro
View on GitHub
Official repo for MMAU-Pro Benchmark
☆22Sep 25, 2025Updated 10 months ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
kaist-ami / AVHBench
View on GitHub
[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆25Mar 8, 2026Updated 4 months ago
pedro-morgado / spatialaudiogen
View on GitHub
Spatial Audio Generation
☆117Mar 24, 2023Updated 3 years ago
google-deepmind / slowfast_nfnets
View on GitHub
☆30Jun 22, 2022Updated 4 years ago
afourast / avobjects
View on GitHub
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
☆114Nov 16, 2020Updated 5 years ago
yongyizang / SingFake
View on GitHub
Official Repository for "SingFake: Singing Voice Deepfake Detection"
☆64Feb 26, 2024Updated 2 years ago
fundwotsai2001 / Text-to-Music_control_family
View on GitHub
Containing SOTA methods that follows time-varying conditions for Text-to-Music
☆24Jan 1, 2026Updated 6 months ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
SVDDChallenge / CtrSVDD2024_Baseline
View on GitHub
Baseline system for SVDD 2024 Challenge CtrSVDD track
☆29Nov 16, 2024Updated last year
Jungjee / ASVspoof_PA
View on GitHub
☆24Jun 28, 2019Updated 7 years ago
facebookresearch / sound-spaces
View on GitHub
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…
☆468Sep 29, 2023Updated 2 years ago
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
hearbenchmark / hear2021-submitted-models
View on GitHub
Open-source audio embedding models, submitted to the HEAR 2021 challenge
☆11Feb 15, 2026Updated 5 months ago
XYPB / CondFoleyGen
View on GitHub
Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".
☆93Dec 8, 2023Updated 2 years ago
bernardo-torres / linear-autoencoders
View on GitHub
Official code and pretrained models for Linear Consistency Autoencoders (Lin-CAE), a method to induce linearity in audio autoencoders via…
☆17Feb 12, 2026Updated 5 months ago
zjsong / SSPL
View on GitHub
PyTorch code for "Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes" (CVPR, 2022…
☆32Jul 8, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
yzyouzhang / AIR-ASVspoof
View on GitHub
Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
☆140Aug 30, 2024Updated last year
Hertin / WavPrompt
View on GitHub
☆37Jun 30, 2022Updated 4 years ago
hvy / chainer-faster-rcnn
View on GitHub
☆10Apr 22, 2016Updated 10 years ago
ktatar / rawaudiovae
View on GitHub
☆12Jun 9, 2025Updated last year
xavierfav / coala
View on GitHub
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations
☆48Jul 25, 2024Updated 2 years ago