Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"
☆22Dec 31, 2025Updated 6 months ago
Alternatives and similar repositories for FOP
Users that are interested in FOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 5, 2025Updated 7 months ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆13Aug 28, 2023Updated 2 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast☆21Oct 25, 2023Updated 2 years ago
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SVHF-Net for Cross-modal binary matching☆32Aug 22, 2018Updated 7 years ago
- ☆18Apr 16, 2026Updated 2 months ago
- Voice-Face Association Learning Evaluation☆49Feb 13, 2024Updated 2 years ago
- ☆11Jun 7, 2023Updated 3 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- ☆28Dec 22, 2021Updated 4 years ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆21Apr 24, 2025Updated last year
- ☆19Nov 19, 2021Updated 4 years ago
- A modern html based rich text editor for iOS and macOS (Catalyst or AppKit) written in Swift. You can use Quill (soon) or Froala.☆11Aug 26, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)☆30Oct 16, 2020Updated 5 years ago
- ☆13Nov 30, 2022Updated 3 years ago
- ☆29Oct 17, 2024Updated last year
- DeepEar: Sound Localization with Binaural Microphones☆15Nov 20, 2025Updated 7 months ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆12Apr 27, 2022Updated 4 years ago
- Official repository of NeXt-TDNN for speaker verification☆83Oct 10, 2024Updated last year
- Learning associations between human faces and voices☆12Feb 15, 2019Updated 7 years ago
- [TMI' 23] FedDM: Federated Weakly Supervised Segmentation via Annotation Calibration and Gradient De-conflicting☆14Mar 11, 2023Updated 3 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆20May 30, 2024Updated 2 years ago
- codes for: Modality to Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion☆48Sep 1, 2021Updated 4 years ago
- [TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆16Nov 22, 2024Updated last year
- Download and preprocess voxceleb datasets.☆40Jun 18, 2025Updated last year
- Scalable framework for comparing metric measure spaces with up to 1M points.☆16Apr 6, 2021Updated 5 years ago
- PyTorch implementation of quantization-aware matrix factorization (QMF) for data compression☆17Jul 14, 2025Updated 11 months ago
- Multi-modal fusion framework based on Transformer Encoder☆16Dec 20, 2020Updated 5 years ago
- This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".☆15Mar 7, 2022Updated 4 years ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆48Apr 29, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch implementation of the RCSLS cross-lingual word embedding alignment method☆12May 1, 2019Updated 7 years ago
- ☆16Apr 27, 2025Updated last year
- End-to-End binaural sound localization☆17Feb 27, 2020Updated 6 years ago
- Element UI 照片墙增加拖动调整顺序功能☆16Oct 19, 2020Updated 5 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- ☆16Mar 18, 2023Updated 3 years ago
- My implement of InstantBooth☆14Sep 11, 2023Updated 2 years ago