rishiswethan / Video-Audio-Face-Emotion-RecognitionView external linksLinks
The repo contains an audio emotion detection model, facial emotion detection model, and a model that combines both these models to predict emotions from a video
☆92Sep 13, 2023Updated 2 years ago
Alternatives and similar repositories for Video-Audio-Face-Emotion-Recognition
Users that are interested in Video-Audio-Face-Emotion-Recognition are comparing it to the libraries listed below
Sorting:
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆50Sep 16, 2024Updated last year
- This repository provides implementation for the paper "Self-attention fusion for audiovisual emotion recognition with incomplete data".☆158Sep 16, 2024Updated last year
- [ECCV2022] The official repository of Emotion-aware Multi-view Contrastive Learning for Facial Emotion Recognition☆24Aug 21, 2023Updated 2 years ago
- Anuj's Portfolio☆11Aug 25, 2025Updated 5 months ago
- Selamat datang di repositori untuk portofolio website pribadi saya! Website ini adalah tempat di mana saya memamerkan karya-karya terbaik…☆15Mar 13, 2025Updated 11 months ago
- This is a plugin for Premiere Pro, which provídes an automated way to update timecodes / start times of media (clips) in your projects.☆10Jul 1, 2024Updated last year
- Photorealism model use RealVisXL v4.0☆12Feb 20, 2024Updated last year
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- This flask web applications extract the text data from documets photos like adhar card, pan card, driving licesnce, Bank Cheque and Other…☆11Dec 2, 2020Updated 5 years ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆39Oct 2, 2022Updated 3 years ago
- [NeurIPS 2022] The official repository of Expression Learning with Identity Matching for Facial Expression Recognition☆44Nov 29, 2023Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…☆11Jul 24, 2024Updated last year
- Batch document loader into Quivr (https://github.com/StanGirard/quivr)☆14Jun 25, 2023Updated 2 years ago
- ☆15Apr 7, 2024Updated last year
- Explore from keyword search to dense retrieval and reranking, which injects the intelligence of LLMs into your search system, making it f…☆14Aug 27, 2023Updated 2 years ago
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- WebXR hand input in Three.JS example☆15Mar 22, 2025Updated 10 months ago
- ☆12Nov 22, 2022Updated 3 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆12Oct 9, 2024Updated last year
- Guided meditation assistant, using scheduled messages with LLaMA☆10Nov 28, 2024Updated last year
- Al-Qur'an yang dikemas dalam bentuk ChatBot☆15Dec 1, 2020Updated 5 years ago
- Python FastApi "Circuit Breaker" implementation☆12Mar 14, 2025Updated 10 months ago
- using mulimodal RAG to query texts, images and tables from pdf for QA☆14Jan 21, 2024Updated 2 years ago
- ☆13Apr 25, 2025Updated 9 months ago
- Change Detection towards Bitemporal Quality Difference via Hierarchical Correlation Distillation☆10Apr 30, 2024Updated last year
- A web based ADB command menu☆12May 29, 2024Updated last year
- Official implementation of "Diffusion models meet image counter-forensics"☆11Jan 22, 2024Updated 2 years ago
- Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation☆19Nov 28, 2022Updated 3 years ago
- Delving into the Continuous Domain Adaptation (ACM MM22)☆12Jul 10, 2022Updated 3 years ago
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆11Aug 11, 2024Updated last year
- Official Vegh Repo☆11Apr 28, 2022Updated 3 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆10Dec 3, 2023Updated 2 years ago
- Flask app for OCR and parsing of a photo of a restaurant receipt.☆13Dec 7, 2022Updated 3 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆12Sep 6, 2024Updated last year
- MOVIO - Online Virtual Exhibitions☆15Nov 23, 2020Updated 5 years ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year