KID-7391 / seeking-the-shape-of-soundView external linksLinks
☆19Jun 8, 2021Updated 4 years ago
Alternatives and similar repositories for seeking-the-shape-of-sound
Users that are interested in seeking-the-shape-of-sound are comparing it to the libraries listed below
Sorting:
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast☆21Oct 25, 2023Updated 2 years ago
- SVHF-Net for Cross-modal binary matching☆32Aug 22, 2018Updated 7 years ago
- Learning associations between human faces and voices☆12Feb 15, 2019Updated 7 years ago
- This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".☆15Mar 7, 2022Updated 3 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Jul 9, 2020Updated 5 years ago
- ☆17Nov 4, 2022Updated 3 years ago
- ☆28Dec 22, 2021Updated 4 years ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆178Mar 24, 2023Updated 2 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- MXNet/Gluon implementation of the original (Gaussian) Variational Autoencoders (VAE)☆10Dec 22, 2017Updated 8 years ago
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆15Jul 24, 2025Updated 6 months ago
- ☆10Jun 2, 2024Updated last year
- Time frequency ridge detection based on relevant ridge portions☆11Aug 17, 2023Updated 2 years ago
- [CVPR 2019] Official Matlab implementation of OSD: Unsupervised image matching and object discovery as optimization.☆12Nov 4, 2021Updated 4 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆12Oct 9, 2024Updated last year
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆22Sep 23, 2025Updated 4 months ago
- A public repository for ConDo (AAAI25 accepted)☆10Dec 21, 2024Updated last year
- Pytorch implementation of our work "Domain-Invariant Representation Learning of Bird Sounds" (arXiv 2024)☆11Feb 20, 2025Updated 11 months ago
- ☆19Aug 7, 2025Updated 6 months ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆12Sep 6, 2024Updated last year
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Mar 9, 2024Updated last year
- [TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"☆11Apr 19, 2022Updated 3 years ago
- ☆12Jan 25, 2025Updated last year
- Patch-Diffusion Code (AAAI2022)☆13Mar 3, 2022Updated 3 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- [ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization☆49Apr 19, 2024Updated last year
- Human age estimation using deep neural networks (Keras)☆13Aug 10, 2023Updated 2 years ago
- 2023 Spring SNU Computer Vision Project☆14Jun 13, 2023Updated 2 years ago
- ☆12Mar 3, 2025Updated 11 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 7 months ago
- ☆11Dec 8, 2022Updated 3 years ago
- ☆13Sep 26, 2023Updated 2 years ago
- ☆10Aug 28, 2020Updated 5 years ago
- [NeurIPS 2023] Official Implementation of "PaintSeg: Painting Pixels for Training-free Segmentation"☆14Dec 31, 2023Updated 2 years ago
- ☆10Nov 16, 2021Updated 4 years ago
- [CVPR 2025] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆16Oct 4, 2025Updated 4 months ago
- Examples of how to use API of MVSep service☆28Jun 21, 2025Updated 7 months ago
- Speaker overlap-aware Neural Diarization☆12Feb 13, 2023Updated 3 years ago