π Pre-process, annotate, evaluate, and train your Affect Computing (e.g., Multimodal Emotion Recognition, Sentiment Analysis) datasets ALL within MER-Factory! (LangGraph Based Agent Workflow)
β99Mar 13, 2026Updated 2 months ago
Alternatives and similar repositories for MER-Factory
Users that are interested in MER-Factory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β21Jun 26, 2025Updated 10 months ago
- β27Oct 16, 2025Updated 7 months ago
- Toolkits for Multimodal Emotion Recognitionβ309Apr 23, 2026Updated last month
- we propose FlexEdit, an end-to-end image editing method that leverages both free-shape masks and language instructions for Flexible Editiβ¦β32Aug 22, 2024Updated last year
- (NeXD @ CVPR 2025) Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Modelsβ29Sep 30, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Awesome papers for affective computing with llm and mllmβ24Nov 26, 2025Updated 5 months ago
- source code for "Towards Speaker-Unknown Emotion Recognition in Conversation via Progressive Contrastive Deep Supervision"β11Nov 22, 2024Updated last year
- Source code for EAC-Net in Theano/Pytorch/Tensorflowβ20Jan 16, 2018Updated 8 years ago
- β30Jun 9, 2025Updated 11 months ago
- The official code of paper "Multi-to-Single: Reducing Multimodal Dependency in Emotion Recognition through Contrastive Learning" (AAAI 20β¦β33Sep 30, 2025Updated 7 months ago
- Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025β19May 8, 2026Updated 2 weeks ago
- The first comprehensive multimodal language analysis benchmark for evaluating foundation modelsβ31Sep 22, 2025Updated 8 months ago
- This paper presents our winning submission to Subtask 2 of SemEval 2024 Task 3 on multimodal emotion cause analysis in conversations.β24Aug 2, 2024Updated last year
- [CVPR2024] Official implementation of the paper: Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learningβ40Aug 15, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β45Jun 27, 2022Updated 3 years ago
- β15Nov 11, 2024Updated last year
- Goodness of Pronunciation algorithm using PyKaldiβ18Jun 12, 2022Updated 3 years ago
- [Communications Medicine] "Efficient deep learning-based automated diagnosis from echocardiography with contrastive self-supervised learnβ¦β20Jul 8, 2024Updated last year
- Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answeringβ11Feb 16, 2023Updated 3 years ago
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).β16Dec 8, 2022Updated 3 years ago
- Auto-KWS 2021 Challenge 1st place solution.β11Jul 20, 2021Updated 4 years ago
- An original package of the dynamic compressive gammachirp filterbank (dcGC-FB)β14Oct 27, 2024Updated last year
- β14Oct 12, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal β¦β24Aug 18, 2025Updated 9 months ago
- Add Rain Streak Mask On Unparied Image Using GANβ10Sep 12, 2020Updated 5 years ago
- ICML-2024 highlight paper "Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization"β19Jul 18, 2024Updated last year
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)β14Jun 23, 2022Updated 3 years ago
- 2020εΉ΄δΊθη½+ζΉθ¨θ½¬ζ’β14Nov 2, 2020Updated 5 years ago
- We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enhβ¦β17Dec 31, 2024Updated last year
- Thermal Indoor Motion Datasetβ17Apr 27, 2023Updated 3 years ago
- A python package of robust and effective defogging/dehazing methodβ15Dec 30, 2018Updated 7 years ago
- [NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answeringβ13Jan 5, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β19Jun 28, 2022Updated 3 years ago
- Code and dataset for NAACL 2022 paper "CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination" Hyounghun Kim, Abhay Zala, Mohiβ¦β16Nov 26, 2022Updated 3 years ago
- β15Apr 4, 2025Updated last year
- A summarization of zero-shot image recognition methods, in the perspective of element-wise representation and reasoning , covering publicβ¦β21Oct 12, 2024Updated last year
- β13Mar 25, 2021Updated 5 years ago
- Multimodal Genuine Emotion and Expression Detection databaseβ12Jul 15, 2024Updated last year
- Python (pip) package for fitting mixtures of Student's t-distributions using either maximum likelihood (EM) or Bayesian methodology (variβ¦β11Sep 23, 2025Updated 8 months ago