Expression Snippet Transformer for Robust Video-based Facial Expression Recognition
☆17Jan 27, 2024Updated 2 years ago
Alternatives and similar repositories for EST
Users that are interested in EST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pose-disentangled Contrastive Learning☆14Jan 27, 2024Updated 2 years ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆38Apr 7, 2025Updated last year
- ☆65Sep 26, 2022Updated 3 years ago
- PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…☆48Mar 2, 2026Updated 3 months ago
- MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition (ACM MM 2023)☆148Nov 16, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CASME II: An Improved Spontaneous Micro-Expression Database and the Baseline Evaluation☆10Oct 19, 2018Updated 7 years ago
- Norface: Improving Facial Expression Analysis by Identity Normalization, ECCV 2024☆37Dec 20, 2024Updated last year
- [BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition☆145Nov 21, 2024Updated last year
- Code for the BEEU challenge winning paper.☆21Sep 5, 2022Updated 3 years ago
- ☆18Mar 30, 2026Updated 2 months ago
- ☆13Jan 2, 2023Updated 3 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆21May 8, 2025Updated last year
- Pytorch implementation for codes in Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis (Accepted by IEEE…☆15Feb 2, 2024Updated 2 years ago
- [NeurIPS 2025] Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM☆25Feb 10, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2023] Code for "Learning Emotion Representations from Verbal and Nonverbal Communication"☆54Feb 19, 2025Updated last year
- EmojiCrypt: Prompt Encryption for Secure Communication with Large Language Models☆25Feb 21, 2024Updated 2 years ago
- Code for Retrieval-Augmented Perception (ICML 2025)☆71Apr 22, 2026Updated last month
- PyTorch使用技巧和教程☆12Apr 17, 2023Updated 3 years ago
- Generating optical flow frame by using TVL1 algorithm in organized way is not so difficult. But for a newbie it is toilsome to find this …☆13Dec 25, 2020Updated 5 years ago
- Coronary Artery segmentation using different CNN models☆15Apr 8, 2019Updated 7 years ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 7 months ago
- Building a real-time environment using webcam frame division in OpenCV and classify cropped images using a fine-tuned vision transformers…☆20Apr 24, 2025Updated last year
- Dateset Reset Policy Optimization☆31Apr 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Oct 10, 2024Updated last year
- ☆23Dec 2, 2025Updated 6 months ago
- Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recogni…☆16Jun 21, 2023Updated 2 years ago
- [Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition☆121Aug 29, 2025Updated 9 months ago
- ☆17Jun 11, 2024Updated 2 years ago
- Code for MInD: Multimodal Information Disentanglement☆20Jun 3, 2026Updated last week
- StressID - a Multimodal Dataset for Stress Identification☆27Dec 5, 2025Updated 6 months ago
- [CVPR'24] Official implementation of our paper "Self-Supervised Facial Representation Learning with Facial Region Awareness"☆15Mar 8, 2024Updated 2 years ago
- Official repo and evaluation implementation of KnowRecall and VisRecall☆10May 22, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Codebase for TF-Mamba: Text-enhanced Fusion Mamba with Missing Modalities for Robust Multimodal Sentiment Analysis. The code has been reo…☆35May 27, 2025Updated last year
- Official implementation for paper "CEPrompt: Cross-Modal Emotion-Aware Prompting for Facial Expression Recognition" (accepted to IEEE TC…☆17Oct 20, 2025Updated 7 months ago
- Spatial Aptitude Training for Multimodal Langauge Models☆33Feb 8, 2026Updated 4 months ago
- Code for MARU-Net: Multi-Scale Attention Gated Residual U-Net With Contrastive Loss for SAR-Optical Image Matching, published in https://…☆20May 23, 2023Updated 3 years ago
- [NeurIPS 2025] TopoPoint: Enhance Topology Reasoning via Endpoint Detection in Autonomous Driving☆37Dec 13, 2025Updated 5 months ago
- Toolkits for Multimodal Emotion Recognition☆318Jun 5, 2026Updated last week
- a novel two-branch MER paradigm☆64Nov 10, 2023Updated 2 years ago