msaadsaeed/FOP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/msaadsaeed/FOP)

msaadsaeed / FOP

Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"

☆23

Alternatives and similar repositories for FOP

Users that are interested in FOP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

msaadsaeed / SBNet
View on GitHub
Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".
☆13Aug 28, 2023Updated 2 years ago
TaoRuijie / MFV-KSD
View on GitHub
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
☆22Jul 25, 2024Updated last year
Cocoxili / CMPC
View on GitHub
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
☆21Oct 25, 2023Updated 2 years ago
my-yy / sl_icmr2022
View on GitHub
Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"
☆15Oct 25, 2024Updated last year
KID-7391 / seeking-the-shape-of-sound
View on GitHub
☆19Jun 8, 2021Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
my-yy / vfal-eva
View on GitHub
Voice-Face Association Learning Evaluation
☆49Feb 13, 2024Updated 2 years ago
burhanahmed1 / TaskSphere
View on GitHub
Integrated .NET-based desktop framework for dynamic task lifecycle management, featuring relational database connectivity, status trackin…
☆12May 7, 2025Updated last year
kjanjua26 / Git-Loss-For-Deep-Face-Recognition
View on GitHub
This repository contains code for my paper "Git Loss for Deep Face Recognition".
☆35Feb 7, 2021Updated 5 years ago
Wenjun-Peng / GPT4SM
View on GitHub
☆11Jun 7, 2023Updated 3 years ago
ASD0x41 / Assembly-Programming-Package
View on GitHub
Tools in Package: Notepad++, DOSBox, NASM & AFD
☆16Jan 28, 2025Updated last year
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
nttcslab / dcase2025_task4_baseline
View on GitHub
☆18Apr 16, 2026Updated 3 months ago
zye1996 / 3DSSD-torch
View on GitHub
☆19Nov 19, 2021Updated 4 years ago
lvyiwei1 / DIME
View on GitHub
☆11Aug 20, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ASD0x41 / xide
View on GitHub
An online x86 assembly IDE, containing the Netwide Assembler (NASM), the Advanced Fullscreen Debugger (AFD) and em-dosbox (a WASM port of…
☆24Jan 27, 2025Updated last year
jiacheng-xu / text-sum-uncertainty
View on GitHub
Code for "Understanding Neural Abstractive Summarization Models via Uncertainty" (EMNLP20)
☆30Oct 16, 2020Updated 5 years ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
trneedham / QuantizedGromovWasserstein
View on GitHub
Scalable framework for comparing metric measure spaces with up to 1M points.
☆16Apr 6, 2021Updated 5 years ago
princeton-nlp / align-mlm
View on GitHub
☆13Nov 30, 2022Updated 3 years ago
ihp-lab / Speaker-Invariant-Domain-Adversarial-Neural-Networks
View on GitHub
☆11Sep 29, 2020Updated 5 years ago
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
dmlguq456 / NeXt_TDNN_ASV
View on GitHub
Official repository of NeXt-TDNN for speaker verification
☆84Oct 10, 2024Updated last year
changil / facevoice
View on GitHub
Learning associations between human faces and voices
☆12Feb 15, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CityU-AIM-Group / FedDM
View on GitHub
[TMI' 23] FedDM: Federated Weakly Supervised Segmentation via Annotation Calibration and Gradient De-conflicting
☆14Mar 11, 2023Updated 3 years ago
xiangyue9607 / Sentence-LDP
View on GitHub
Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"
☆12Feb 20, 2023Updated 3 years ago
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
TmacMai / ARGF_multimodal_fusion
View on GitHub
codes for: Modality to Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion
☆48Sep 1, 2021Updated 4 years ago
StelaBou / voxceleb_preprocessing
View on GitHub
Download and preprocess voxceleb datasets.
☆41Jun 18, 2025Updated last year
Sunner4nwpu / TEMMA
View on GitHub
Multi-modal fusion framework based on Transformer Encoder
☆16Dec 20, 2020Updated 5 years ago
XuezheMax / gecko-llm
View on GitHub
Gecko Architecture
☆16Jan 13, 2026Updated 6 months ago
crlandsc / torch-log-wmse
View on GitHub
logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…
☆48Apr 29, 2026Updated 2 months ago
BAI-Yeqi / SF2F_PyTorch
View on GitHub
☆16Apr 27, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
plnguyen2908 / UniTalk-ASD-code
View on GitHub
[Interspeech 2026] Revisiting Active Speaker Detection: An In-the-Wild Benchmark for Generalization and Robustness
☆21Jun 25, 2026Updated 3 weeks ago
DeepMIALab / PathoSeg
View on GitHub
☆20Jul 2, 2023Updated 3 years ago
crlandsc / moises-light
View on GitHub
Unofficial PyTorch implementation of "Moises-Light: Resource-efficient Band-split U-Net For Music Source Separation"
☆32May 1, 2026Updated 2 months ago
tuncayka / speech_emotion
View on GitHub
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)
☆19Dec 8, 2022Updated 3 years ago
tdavislab / verb
View on GitHub
☆16Mar 18, 2023Updated 3 years ago
saurjya / EnsembleSep
View on GitHub
This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.
☆12Nov 7, 2024Updated last year
Pzoom522 / L1-Refinement
View on GitHub
Code for "Cross-Lingual Word Embedding Refinement by ℓ1 Norm Optimisation" (NAACL 2021)
☆17Jun 16, 2022Updated 4 years ago