my-yy/vfal_papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/my-yy/vfal_papers)

my-yy / vfal_papers

Voice Face Association Learning Paper List

☆17

Alternatives and similar repositories for vfal_papers

Users that are interested in vfal_papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cocoxili / CMPC
View on GitHub
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
☆21Oct 25, 2023Updated 2 years ago
msaadsaeed / SBNet
View on GitHub
Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".
☆13Aug 28, 2023Updated 2 years ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
View on GitHub
☆12Jun 14, 2022Updated 4 years ago
MrChenFeng / Adaptive-Soft-Contrastive-Learning_ICPR2022
View on GitHub
ASCL: adpative Soft Contrastive Learning (ICPR2022)
☆22Mar 22, 2025Updated last year
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mavceleb / mavceleb_baseline
View on GitHub
☆11Nov 5, 2025Updated 8 months ago
TaoRuijie / MFV-KSD
View on GitHub
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
☆22Jul 25, 2024Updated last year
BAI-Yeqi / SF2F_PyTorch
View on GitHub
☆16Apr 27, 2025Updated last year
KID-7391 / seeking-the-shape-of-sound
View on GitHub
☆19Jun 8, 2021Updated 5 years ago
Levent9 / Zero-shot-FaceVC
View on GitHub
☆19Mar 2, 2024Updated 2 years ago
my-yy / vfal-eva
View on GitHub
Voice-Face Association Learning Evaluation
☆49Feb 13, 2024Updated 2 years ago
walkoncross / voxceleb2-download-zyf
View on GitHub
Tools for downloading VoxCeleb2 dataset
☆35Mar 16, 2024Updated 2 years ago
wangshaonan / Associative-multichannel-autoencoder
View on GitHub
code for EMNLP2018 paper 'Associative-multichannel-autoencoder for multimodal word representation'
☆13Aug 24, 2018Updated 7 years ago
penghu-cs / RONO
View on GitHub
RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval (CVPR 2023, PyTorch Code)
☆23Mar 11, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
TaoRuijie / Loss-Gated-Learning
View on GitHub
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
☆92May 29, 2023Updated 3 years ago
18573462816 / MEBCRN
View on GitHub
Deep learning network MEBCRN for separation of fat and water magnetic resonance images
☆11Dec 29, 2020Updated 5 years ago
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
simonsuthers / IBM-Separation
View on GitHub
Python code to show basic sound separation using Ideal Binary Masks
☆13Oct 13, 2018Updated 7 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
akhilmathurs / collossl
View on GitHub
☆16Jul 2, 2022Updated 4 years ago
joewilaj / nbaGNNs
View on GitHub
Graph neural network models to perform link prediction on the nba and ncaa point differential graph on seasons 2013-2019, 2021.
☆10Jul 4, 2021Updated 5 years ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
PunkMale / ECAPA-TDNN-CNCeleb
View on GitHub
针对CN-Celeb数据集的基于ECAPA-TDNN的说话人识别的pytorch实现
☆13Apr 3, 2023Updated 3 years ago
TaoRuijie / AVCleanse
View on GitHub
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
nazmul-karim170 / CNLL
View on GitHub
[CVPR'22] Official Implementation of "CNLL: A Semi-supervised Approach for Continual Noisy Label Learning"
☆18Oct 8, 2024Updated last year
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
Zhang-xie / CSI-HAR-dataset-survey
View on GitHub
A Survey on Wi-Fi Channel State Information Datasets for Human Activity Recognition
☆14Aug 3, 2022Updated 3 years ago
CV-IP / VFD
View on GitHub
This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".
☆15Mar 7, 2022Updated 4 years ago
StelaBou / voxceleb_preprocessing
View on GitHub
Download and preprocess voxceleb datasets.
☆41Jun 18, 2025Updated last year
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
marcsous / pdff
View on GitHub
Proton density fat fraction calculation for MRI
☆12Jul 2, 2025Updated last year
pierrefdz / stable_signature
View on GitHub
Please go to https://github.com/facebookresearch/stable_signature
☆14Jul 26, 2023Updated 2 years ago
cyjie429 / RegO
View on GitHub
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
☆14Dec 17, 2024Updated last year
LASP-UCL / Graph-RL
View on GitHub
Graph-based Reinforcement Learning
☆16Jul 9, 2018Updated 8 years ago
FilippoMB / Variational-Graph-Auto-encoders-Tensorflow-2-Spektral-
View on GitHub
Implementation of the Variational Graph Auto-encoder in Spektral (Tensorflow-Keras)
☆14Mar 15, 2025Updated last year
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
welcheb / FattyRiot
View on GitHub
FattyRiot algorithm for separation of fat and water magnetic resonance images
☆14Nov 5, 2015Updated 10 years ago