The Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmented-reality (AR) -motivated multi-sensor egocentric world view.
☆134Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for EasyComDataset
Users that are interested in EasyComDataset are comparing it to the libraries listed below
Sorting:
- Blind Identification of Binaural Room Impulse Responses from Head-Worn Microphone Arrays☆20Sep 18, 2024Updated last year
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated 2 years ago
- This repository contains code for the generation of binaural Room Impulse Responses using the Paraspax method and implementing a 6 DoF en…☆31Nov 20, 2024Updated last year
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆14Dec 21, 2024Updated last year
- Code for paper Learning Audio-Visual Dereverberation☆31Aug 10, 2022Updated 3 years ago
- A list of publications that have accompanying open-source code☆22Oct 30, 2023Updated 2 years ago
- MeshRIR: Dataset of room impulse responses on meshed grid points☆43Mar 13, 2026Updated last week
- Different implementations of "Weighted Prediction Error" for speech dereverberation☆557Mar 19, 2025Updated last year
- Real-Time Spherical Array Renderer for binaural reproduction in Python☆79Dec 2, 2023Updated 2 years ago
- SPEAR Challenge scripts and tools.☆24Mar 17, 2023Updated 3 years ago
- Implementation of algorithms for refinement of direction of arrival estimators by optimization☆16Jun 2, 2021Updated 4 years ago
- N/A☆187May 19, 2022Updated 3 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Jul 24, 2023Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple …☆15Mar 30, 2023Updated 2 years ago
- Python loaders for many Real Room Impulse Response databases☆96Sep 30, 2024Updated last year
- Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks☆88Mar 24, 2023Updated 2 years ago
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆164Jan 20, 2024Updated 2 years ago
- An auralisation system that takes a head-worn microphone array recordings as input and renders the audio for binaural playback; taking in…☆35Oct 10, 2023Updated 2 years ago
- A Repository of Room Responses and 360 Videos of a Variable Acoustics Lab☆45Mar 14, 2023Updated 3 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Python library for Room Impulse Response (RIR) simulation with GPU acceleration☆586Jul 18, 2025Updated 8 months ago
- Analyze, visualize, and process sound field data recorded by spherical microphone arrays.☆106Dec 29, 2022Updated 3 years ago
- ☆210Dec 4, 2023Updated 2 years ago
- Room Impulse Response Generator (MATLAB)☆489Oct 24, 2025Updated 4 months ago
- A list of datasets made available by members of the Aalto Acoustics Lab☆29Sep 6, 2024Updated last year
- Repo for our research paper "Learning Acoustic Scattering Fields for Dynamic Interactive Sound Propagation"☆17Apr 6, 2021Updated 4 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆155Apr 29, 2025Updated 10 months ago
- ☆53May 15, 2025Updated 10 months ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆27Jan 6, 2024Updated 2 years ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆128Jun 7, 2024Updated last year
- Easy to use Beamformers for multi-channel speech separation/enhancement☆211Jan 26, 2021Updated 5 years ago
- Translating Synthetic RIRs to Real RIRs☆45Sep 15, 2023Updated 2 years ago
- Blind System Identification and Equalization Toolbox☆20Jul 9, 2018Updated 7 years ago
- Official data preparation scripts for the URGENT 2024 Challenge☆87May 21, 2025Updated 10 months ago
- A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple task…☆447Sep 29, 2023Updated 2 years ago
- Binaural impulse responses captured in real rooms.☆37Mar 9, 2016Updated 10 years ago