romainloiseau / a-model-you-can-hearLinks
Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper
☆37Updated 3 years ago
Alternatives and similar repositories for a-model-you-can-hear
Users that are interested in a-model-you-can-hear are comparing it to the libraries listed below
Sorting:
- Toolbox for HelixNet, a dataset presented in the "Online Segmentation of LiDAR Sequences: Dataset and Algorithm" paper☆44Updated 2 years ago
- (ICCV 2021) Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper☆46Updated 2 years ago
- ☆29Updated 2 years ago
- (CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery☆53Updated 3 years ago
- PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.☆30Updated 3 years ago
- Toolbox for the Earth Parser Dataset, a dataset presented in the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" pape…☆26Updated 2 years ago
- utility functions for CIL☆20Updated last year
- Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)☆10Updated 8 months ago
- [ICLR 2025] NeRAF jointly learns acoustic and radiance fields, enabling realistic audio-visual generation.☆29Updated 5 months ago
- ☆19Updated 2 years ago
- Code for Novel View Acoustic Synthesis paper☆51Updated 2 years ago
- Official code for "Learning Neural Acoustic Fields" (NeurIPS 2022)☆152Updated last year
- ☆14Updated 4 years ago
- (IGARSS 2025) Prototype-based method for agricultural image time series classification.☆43Updated last year
- Reliability in Semantic Segmentation: Can We Use Synthetic Data? (ECCV 2024)☆39Updated last year
- ☆46Updated last year
- Evaluation script for VoxMovies dataset in PyTorch☆23Updated last year
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆40Updated last year
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"☆77Updated 4 months ago
- Official Pytorch implementation of the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" paper☆60Updated last year
- Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight☆57Updated 3 months ago
- (3DV 2021 oral) PyTorch implementation of paper "PoseContrast: Class-Agnostic Object Viewpoint Estimation in the Wild with Pose-Aware Con…☆45Updated last year
- Implementation of the multi-temporal UTAE for the task of satellite image time series semantic change detection (SITS-SCD)☆57Updated last year
- Repo for Visual Acoustic Matching, CVPR 2022☆68Updated 2 years ago
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆34Updated last year
- High order Moment Models☆42Updated last month
- Facestar dataset. High quality audio-visual recordings of human conversational speech.☆110Updated 3 years ago
- (EarthVision 2025 - CVPR Workshop) Official repository of DAFA-LS, a dataset of satellite image time series for the task of archaeologica…☆38Updated 11 months ago
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆23Updated last year
- official implementation of the Polynomial Mixer☆22Updated last month