Cocoxili/CMPC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cocoxili/CMPC)

Cocoxili / CMPC

[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast

☆21

Alternatives and similar repositories for CMPC

Users that are interested in CMPC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

my-yy / vfal_papers
View on GitHub
Voice Face Association Learning Paper List
☆17May 20, 2023Updated 3 years ago
msaadsaeed / SBNet
View on GitHub
Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".
☆13Aug 28, 2023Updated 2 years ago
KID-7391 / seeking-the-shape-of-sound
View on GitHub
☆19Jun 8, 2021Updated 5 years ago
BAI-Yeqi / SF2F_PyTorch
View on GitHub
☆16Apr 27, 2025Updated last year
my-yy / sl_icmr2022
View on GitHub
Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"
☆15Oct 25, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CV-IP / VFD
View on GitHub
This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".
☆15Mar 7, 2022Updated 4 years ago
jamessealesmith / ConStruct-VL
View on GitHub
PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"
☆13Feb 5, 2024Updated 2 years ago
msaadsaeed / FOP
View on GitHub
Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"
☆23Dec 31, 2025Updated 6 months ago
TaoRuijie / MFV-KSD
View on GitHub
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
☆22Jul 25, 2024Updated 2 years ago
my-yy / vfal-eva
View on GitHub
Voice-Face Association Learning Evaluation
☆49Feb 13, 2024Updated 2 years ago
EvelynChee / LO2LN
View on GitHub
☆10May 16, 2025Updated last year
chester-w-xie / FCAC_datasets
View on GitHub
Details of the datasets for Few-shot class-incremental audio classification
☆10Dec 6, 2023Updated 2 years ago
TakHemlata / T-EER
View on GitHub
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
☆14Sep 25, 2023Updated 2 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
sectum1919 / cncvs_data_collector
View on GitHub
☆27Jun 27, 2023Updated 3 years ago
v3tech / YunSDR
View on GitHub
YunSDR Open Source Project
☆14Feb 24, 2017Updated 9 years ago
seongmin-kye / CAP
View on GitHub
Cross attentive pooling for speaker verification (IEEE SLT, 2021)
☆12Dec 14, 2020Updated 5 years ago
TamashaM / NAPA-VQ
View on GitHub
☆11Jul 4, 2024Updated 2 years ago
Liu-Tianchi / Golden-Gemini-for-Speaker-Verification
View on GitHub
Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'
☆15Jan 20, 2025Updated last year
statusrank / A-Generic-Framework-for-Optimizing-Two-way-Partial-AUC
View on GitHub
This is an official PyTorch code for our accepted paper "When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-w…
☆15Jul 7, 2022Updated 4 years ago
PeterouZh / Deep_Generative_Models
View on GitHub
A collection of papers I am interested in.
☆29Apr 3, 2023Updated 3 years ago
pkuschool / intro
View on GitHub
给新生用的 Introduction
☆14Jun 23, 2026Updated last month
StelaBou / voxceleb_preprocessing
View on GitHub
Download and preprocess voxceleb datasets.
☆41Jun 18, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cyjie429 / RegO
View on GitHub
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
☆14Dec 17, 2024Updated last year
JuanFMontesinos / Acappella-YNet
View on GitHub
Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21
☆18May 14, 2022Updated 4 years ago
JaesungHuh / VoxSRC2021
View on GitHub
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021
☆19Jul 21, 2021Updated 5 years ago
vinceasvp / meta-sc
View on GitHub
☆11May 30, 2023Updated 3 years ago
tqbl / arca23k-dataset
View on GitHub
The code used to create the ARCA23K and ARCA23K-FSD datasets
☆16Nov 9, 2021Updated 4 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
ChanghwaPark / DANN-tf2
View on GitHub
Tensorflow 2.0 implementation of Domain Adversarial Neural Networks (DANN)
☆12Dec 3, 2019Updated 6 years ago
MacLLL / SELC
View on GitHub
self ensemble label correction
☆17Jul 29, 2022Updated 3 years ago
mogar / uhd_ofdm
View on GitHub
OFDM implementation for GNURadio using UHD
☆12Sep 27, 2011Updated 14 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chenshen03 / Deepfakes-Detection-Papers
View on GitHub
The papers of Deepfakes Detection.
☆23Feb 3, 2021Updated 5 years ago
tuncayka / speech_emotion
View on GitHub
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)
☆19Dec 8, 2022Updated 3 years ago
natee / el-upload-sortable
View on GitHub
Element UI 照片墙增加拖动调整顺序功能
☆16Oct 19, 2020Updated 5 years ago
ihp-lab / Speaker-Invariant-Domain-Adversarial-Neural-Networks
View on GitHub
☆11Sep 29, 2020Updated 5 years ago
THUsatlab / BERT-LID
View on GitHub
Leveraging BERT to Improve Spoken Language Identification
☆17Nov 22, 2022Updated 3 years ago
UtkMSNL / IEEE802.11-complete
View on GitHub
IEEE 802.11 transceiver design with MAC & rate adaptation
☆18Dec 16, 2017Updated 8 years ago
samuelyu2002 / PACS
View on GitHub
Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)
☆18Dec 20, 2022Updated 3 years ago