[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
☆21Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for CMPC
Users that are interested in CMPC are comparing it to the libraries listed below
Sorting:
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- Voice Face Association Learning Paper List☆17May 20, 2023Updated 2 years ago
- ☆19Jun 8, 2021Updated 4 years ago
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆21Dec 31, 2025Updated 2 months ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"☆14Feb 5, 2024Updated 2 years ago
- ☆28Dec 22, 2021Updated 4 years ago
- ☆16Apr 27, 2025Updated 10 months ago
- This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".☆15Mar 7, 2022Updated 3 years ago
- ☆17Jan 26, 2021Updated 5 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- ☆21Mar 4, 2024Updated last year
- Web app for reductive analyses of scores☆20Feb 1, 2026Updated last month
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- A collection of papers I am interested in.☆29Apr 3, 2023Updated 2 years ago
- Discrete wavelet transform layers with fixed and trainable wavelets☆22Nov 27, 2022Updated 3 years ago
- ☆27Jun 27, 2023Updated 2 years ago
- A Java project which is able to split MIDI performance data into monophonic voices.☆23Aug 26, 2020Updated 5 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Voice-Face Association Learning Evaluation☆49Feb 13, 2024Updated 2 years ago
- Trained data models for madmom: https://github.com/CPJKU/madmom☆25Mar 22, 2022Updated 3 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- The code of '3D-Aware Semantic-Guided Generative Model for Human Synthesis' (ECCV 2022)☆36Jul 18, 2022Updated 3 years ago
- [ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice☆28Dec 8, 2022Updated 3 years ago
- ☆36May 24, 2024Updated last year
- Download and preprocess voxceleb datasets.☆38Jun 18, 2025Updated 8 months ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- SGAP-Net: Semantic-Guided Attentive Prototypes Network for Few-Shot Human-Object Interaction Recognition, AAAI2020.☆14Dec 15, 2020Updated 5 years ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- The Introduction of the OLKAVS Dataset☆37May 28, 2024Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Official implementation of "Flying Guide Dog: Walkable Path Discovery for the Visually Impaired Utilizing Drones and Transformer-based Se…☆14Feb 6, 2022Updated 4 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆14Oct 22, 2024Updated last year
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago