Dynamic and static models for real-time facial emotion recognition
☆180Aug 2, 2024Updated last year
Alternatives and similar repositories for EMO-AffectNetModel
Users that are interested in EMO-AffectNetModel are comparing it to the libraries listed below
Sorting:
- Read articles, explore effectiveness metrics for speech enhancement methodologies. Seamlessly integrate code implementations for better u…☆26Apr 19, 2024Updated last year
- Official implementation of the NeurIPS2023 paper: Leave No Stone Unturned: Mine Extra Knowledge for Imbalanced Facial Expression Recognit…☆32Oct 30, 2023Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- FG 2024 Papers: Explore a comprehensive collection of research papers presented at one of the premier conferences on automatic face and g…☆16May 18, 2024Updated last year
- ☆14May 22, 2018Updated 7 years ago
- Algorithms for Intelligent Assessment of Human Personality Traits based on His Multimodal Data for ranking potential candidates to perfo…☆57Dec 5, 2025Updated 3 months ago
- arxiv daily for speech translation, legal. Ref: Vincentqyw/cv-arxiv-daily☆15Jan 6, 2025Updated last year
- Efficient face emotion recognition in photos and videos☆983Sep 26, 2025Updated 5 months ago
- This repository provides the codes for MMA-DFER: multimodal (audiovisual) emotion recognition method. This is an official implementation …☆50Sep 16, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆43Mar 3, 2025Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Dec 4, 2024Updated last year
- Project to infere emotional expressions and benchmark datasets by Niklas Wagner, Felix Mätzler, Samed R. Vossberg, Helen Schneider and Sv…☆28Mar 7, 2025Updated 11 months ago
- ☆13Nov 22, 2022Updated 3 years ago
- ☆11Nov 7, 2024Updated last year
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 3 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- YSC 2023 Papers: A complete collection of research papers, code and data from the International Young Scientists Conference 2023 for youn…☆12Jan 17, 2024Updated 2 years ago
- ☆40Jul 15, 2025Updated 7 months ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- ☆18Mar 17, 2025Updated 11 months ago
- ☆15Nov 11, 2024Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆29Jul 9, 2024Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition (ACM MM 2023)☆144Nov 16, 2025Updated 3 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15May 16, 2025Updated 9 months ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- ☆16Nov 9, 2023Updated 2 years ago
- Guided Interpretable Facial Expression Recognition via Spatial Action Unit Cues☆16Sep 16, 2024Updated last year