romainloiseau / a-model-you-can-hearLinks
Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper
☆37Updated 3 years ago
Alternatives and similar repositories for a-model-you-can-hear
Users that are interested in a-model-you-can-hear are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)☆10Updated 9 months ago
- PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.☆30Updated 3 years ago
- (CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery☆53Updated 3 years ago
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago
- ☆31Updated 2 years ago
- (ICCV 2021) Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper☆46Updated 2 years ago
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆32Updated 7 months ago
- Toolbox for the Earth Parser Dataset, a dataset presented in the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" pape…☆26Updated 2 years ago
- Toolbox for HelixNet, a dataset presented in the "Online Segmentation of LiDAR Sequences: Dataset and Algorithm" paper☆44Updated 3 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year
- ☆48Updated last year
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆155Updated last year
- ☆19Updated 3 years ago
- utility functions for CIL☆20Updated last year
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆41Updated last year
- official implementation of the Polynomial Mixer☆22Updated 3 months ago
- (IGARSS 2025) Prototype-based method for agricultural image time series classification.☆44Updated last year
- Codebase for the paper "Visually Informed Binaural Audio Generation without Binaural Audios" (CVPR 2021)☆68Updated 4 years ago
- Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight☆57Updated 4 months ago
- ☆32Updated 3 years ago
- Official Pytorch implementation of the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" paper☆61Updated last year
- Repo for Visual Acoustic Matching, CVPR 2022☆70Updated 2 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Updated last year
- High order Moment Models☆42Updated last month
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆34Updated last year
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"☆83Updated 6 months ago
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆110Updated 3 years ago
- ☆14Updated 4 years ago
- [NeurIPS'24 splotlight] Official Repo for AcoustiX used in Acoustic volume rendering for neural impulse response fields.☆35Updated 9 months ago
- ☆47Updated last year