my-yy / sl_icmr2022Links

Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"

☆13

Alternatives and similar repositories for sl_icmr2022

Users that are interested in sl_icmr2022 are comparing it to the libraries listed below

Sorting:

ms-dot-k / Visual-Audio-Memory
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆20Updated 3 years ago
praveena2j / Cross-Attentional-AV-Fusion
FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition
☆32Updated 8 months ago
katerynaCh / MMA-DFER
This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …
☆38Updated 10 months ago
Exgc / OpenSR
The official implementation of OpenSR (ACL2023 Oral)
☆15Updated last year
Cocoxili / CMPC
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
☆20Updated last year
shanwangshan / TAU-urban-audio-visual-scenes
☆10Updated 3 years ago
KID-7391 / seeking-the-shape-of-sound
☆18Updated 4 years ago
ms-dot-k / Multi-head-Visual-Audio-Memory
PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)
☆27Updated last year
Vincent-ZHQ / DMER
A survey of deep multimodal emotion recognition.
☆53Updated 3 years ago
yunyikristy / global_local
☆14Updated 3 years ago
praveena2j / JointCrossAttentional-AV-Fusion
ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition
☆46Updated last year
YapengTian / CCOL-CVPR21
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆25Updated 3 years ago
scutcsq / DWFormer
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆61Updated last year
HappyColor / SpeechFormer2
SpeechFormer++ in PyTorch
☆48Updated 2 years ago
Janie1996 / AV4SER
PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition
☆12Updated 3 years ago
LUMIA-Group / Leveraging-Self-Supervised-Learning-for-AVSR
Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…
☆66Updated 3 years ago
ms-dot-k / LRW_ID
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Updated last year
DanielMengLiu / AudioVisualLip
☆23Updated last year
msaadsaeed / FOP
Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"
☆19Updated last year
ahaliassos / raven
Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)
☆67Updated 5 months ago
praveena2j / RecurrentJointAttentionwithLSTMs
ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
☆14Updated 8 months ago
mispchallenge / MISP-ICME-AVSR
☆16Updated last year
stoneMo / EZ-VSL
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
☆35Updated 2 years ago
zcxu-eric / AVA-AVD
☆45Updated 2 years ago
fuankarion / active-speakers-context
Code for the Active Speakers in Context Paper (CVPR2020)
☆54Updated 4 years ago
Jiang-Yidi / TS-TalkNet
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆52Updated 2 years ago
Strong-AI-Lab / emotion
Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,
☆56Updated 9 months ago
joannahong / AV-RelScore
Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…
☆34Updated 2 years ago
prajwalkr / vtp
Official Implementation of Visual Transformer Pooling for Lip reading
☆40Updated 3 years ago
GeWu-Lab / MMCosine_ICASSP23
The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"
☆21Updated 2 years ago