Paper List
☆18Jul 2, 2025Updated 11 months ago
Alternatives and similar repositories for Emotion-Recognition
Users that are interested in Emotion-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICANN 2023] Anomaly-Based Insider Threat Detection via Hierarchical Information Fusion☆18Nov 20, 2023Updated 2 years ago
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Nov 14, 2023Updated 2 years ago
- [IEEE TASLP] Retrieval-Augmented MOS Prediction with Prior Knowledge Integration☆33Mar 23, 2025Updated last year
- Dataset [ACL 2026]☆33Jul 31, 2025Updated 10 months ago
- ☆45Apr 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NCMMSC]☆16Feb 19, 2025Updated last year
- [AAAI 2026 & ACL 2026] The official implementation of the DIFFA series for dLLM-based large audio language model☆81Apr 7, 2026Updated 2 months ago
- VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis☆13Dec 26, 2024Updated last year
- 【CVPR 2026 Finding】Official Repo for Paper ‘’Heartcare Suite: A Unified Multimodal ECG Suite for Dual Signal-Image Modeling and Understan…☆32Feb 24, 2026Updated 3 months ago
- The codes for the paper of "A particle swarm optimization-based flexible convolutional auto-encoder for image classification" published b…☆10Jul 21, 2020Updated 5 years ago
- An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)☆37Sep 28, 2023Updated 2 years ago
- ☆37Updated this week
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- A repository used to organize content related to Large Speech(Audio) Model, including paper, data, applications, tools and so on.☆28Nov 8, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆23Dec 17, 2025Updated 5 months ago
- ☆22Dec 17, 2024Updated last year
- Accompanying code for our paper "Point Cloud Audio Processing"☆18Jul 1, 2021Updated 4 years ago
- Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别☆10Jul 1, 2019Updated 6 years ago
- ☆13Apr 2, 2025Updated last year
- This project is the official implementation of ``Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation'' in PyTorch, wh…☆12Nov 4, 2022Updated 3 years ago
- Code release for Unsupervised Domain Adaptation via Distilled Discriminative Clustering published by Pattern Recognition in 2022☆11May 19, 2023Updated 3 years ago
- Audio Processing & Visualization Concepts☆12Jun 20, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- A Collection of Papers on Diffusion Large Language Models☆47May 12, 2026Updated 3 weeks ago
- ☆26Nov 6, 2025Updated 7 months ago
- The official implementation of the paper "Affective Faces for Goal-Driven Dyadic Communication."☆15Jan 27, 2023Updated 3 years ago
- python爬虫☆16Jan 10, 2024Updated 2 years ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆36Oct 23, 2025Updated 7 months ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆28May 30, 2025Updated last year
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"