The repo contains an audio emotion detection model, facial emotion detection model, and a model that combines both these models to predict emotions from a video
☆95Sep 13, 2023Updated 2 years ago
Alternatives and similar repositories for Video-Audio-Face-Emotion-Recognition
Users that are interested in Video-Audio-Face-Emotion-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆57Sep 16, 2024Updated last year
- PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition☆12Mar 20, 2022Updated 4 years ago
- Explore the world of non-verbal communication like never before with our Body Language Detection solution. Utilizing the advanced capabil…☆17Sep 24, 2023Updated 2 years ago
- Multimodal sentiment analysis☆26Jul 17, 2023Updated 2 years ago
- Multimodal Emotion Recognition in Conversation Challenge( CCAC 2023)☆38May 10, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆47Nov 29, 2024Updated last year
- [ECCV2022] The official repository of Emotion-aware Multi-view Contrastive Learning for Facial Emotion Recognition☆25Aug 21, 2023Updated 2 years ago
- This API utilizes a pre-trained model for emotion recognition from audio files. It accepts audio files as input, processes them using the…☆15Apr 23, 2024Updated 2 years ago
- Auto Generate Subtitle File For Any Type Of Audio and Video. Using Python and Google Speech-to-Text API.☆13May 15, 2020Updated 6 years ago
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- Expressive TTS Dataset for Assamese, Bengali, and Tamil.☆15Mar 6, 2025Updated last year
- A Fully End2End Multimodal System for Fast Yet Effective Video Emotion Recognition☆39Aug 12, 2024Updated last year
- Natural Language processing in tensorflow☆15Apr 11, 2022Updated 4 years ago
- Natural Language to SQL using Google's Gemini Pro Model☆12Dec 27, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Audio-Visual Speech Recognition☆26Jul 7, 2025Updated 11 months ago
- ☆13Dec 28, 2024Updated last year
- Official repo of the paper Deep Regression Unlearning accepted in ICML 2023☆16Jun 14, 2023Updated 3 years ago
- Top-tier conference papers on out-of-distribution detection☆11Jun 22, 2023Updated 2 years ago
- Huggingface Implementation of AV-HuBERT on the MuAViC Dataset☆19Mar 6, 2025Updated last year
- 缺陷检测是图像处理领域一个应用广泛的问题。本课题依托科研项目,采用无人机上的图像探测器采集工厂内部货架图片;通过图片配准及比对,识别螺丝松动等缺陷。从而防止隐患的发生。也可以使用公开数据集处理,课题主要是算法,不限制算法依托的软件平台。☆15Feb 20, 2024Updated 2 years ago
- This repo contains a list of questions to practice SQL with the Sakila Database.☆10Jul 29, 2022Updated 3 years ago
- Alzheimer's prediction system deployed on streamlit that employs Logistic Regression to classify whether a person is prone to having Alzh…☆15Dec 5, 2023Updated 2 years ago
- Classification of Fundus Images into 5 stages of Diabetic Retinopathy, and segmentation of blood vessels in fundus images☆19Sep 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fashion Clothes Generation Using GANS☆13Jun 9, 2021Updated 5 years ago
- Anuj's Portfolio☆14Aug 25, 2025Updated 9 months ago
- MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition (ACM MM 2023)☆149Nov 16, 2025Updated 7 months ago
- Floating piano keys in the air. Bring your finger to the keys you want to play and click the keys and make your beautiful melodies☆13Jan 22, 2022Updated 4 years ago
- ☆224Apr 26, 2026Updated last month
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆93Jul 23, 2025Updated 10 months ago
- ☆30Jun 2, 2025Updated last year
- [IEEE ICPRS 2024 Oral] TensorFlow code implementation of "MultiMAE-DER: Multimodal Masked Autoencoder for Dynamic Emotion Recognition"☆19Mar 13, 2026Updated 3 months ago
- An interactive visualization platform for learning data structures.☆10May 7, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- A project between Anyscale and deepsense.ai implementing a cross-modal search application for e-commerce☆13Jun 5, 2024Updated 2 years ago
- ☆14Sep 29, 2025Updated 8 months ago
- 一个基于 FastApi 的接口示例,集成用户管理、实时多人聊天。☆14Feb 14, 2023Updated 3 years ago
- Custom FastAPI boilerplate with Piccolo ORM, JWT auth, config file and Hashicorp Vault support☆19Sep 13, 2022Updated 3 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 5 months ago