DmitryRyumin / FG-2024-PapersLinks
FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and gesture recognition. Seamlessly integrate code implementations for better understanding. ⭐ Experience the cutting edge of progress in facial analysis, gesture recognition, and biometrics with this repository!
☆14Updated last year
Alternatives and similar repositories for FG-2024-Papers
Users that are interested in FG-2024-Papers are comparing it to the libraries listed below
Sorting:
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)☆14Updated last year
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆54Updated 3 months ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆36Updated 2 years ago
- [NeurIPS 2022] The official repository of Expression Learning with Identity Matching for Facial Expression Recognition☆42Updated last year
- python scripts for crawling original image from Google Images☆22Updated 3 years ago
- Official implementation of the CVPR 2024 paper "FSRT: Facial Scene Representation Transformer for Face Reenactment from Factorized Appear…☆113Updated 9 months ago
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆13Updated 2 weeks ago
- GANalyzer: Analysis and Manipulation of GANs Latent Space for Controllable Face Synthesis☆38Updated last year
- Graph learning framework for long-term video understanding☆63Updated 2 weeks ago
- Multimodal Empathetic Chatbot☆39Updated 11 months ago
- [ECCV 2024] Dyadic Interaction Modeling for Social Behavior Generation☆56Updated 2 months ago
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆41Updated last month
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆171Updated last month
- Preprocessing Scipts for Talking Face Generation☆88Updated 5 months ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆14Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆84Updated last year
- 📖 A curated list of resources dedicated to avatar.☆59Updated 7 months ago
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆148Updated 11 months ago
- Pre-trained model weights of MAE-Face.☆32Updated last year
- A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or v…☆36Updated last year
- A cleaned version of IMDB-WIKI dataset for facial age estimation.☆48Updated last year
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Updated last year
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆49Updated 6 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆80Updated last year
- GPT-4V with Emotion☆93Updated last year
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆46Updated 9 months ago
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆24Updated 3 weeks ago
- ☆32Updated 2 years ago
- This repo is used for recording and tracking some Multi-modal Body Language researchs,In this work, we present the first detailed survey …☆19Updated last year
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆43Updated last year