☆28Dec 14, 2025Updated 3 months ago
Alternatives and similar repositories for MPDD
Users that are interested in MPDD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Mar 18, 2026Updated last week
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 4 months ago
- ☆23Jan 29, 2026Updated last month
- MMER☆16Jan 8, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆11Feb 14, 2025Updated last year
- [ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"☆15Jul 6, 2024Updated last year
- Example of application of genetic algorithm for evolution kart navigation.☆11Nov 21, 2019Updated 6 years ago
- The final coursework for AI in Mental Health @ PKU.☆19Jan 5, 2024Updated 2 years ago
- Dimensional estimation of emotions (Arousal, Valence, Intensity) from facial landmarks extracted by DLIB.☆28Jan 14, 2026Updated 2 months ago
- This GitHub provides the source code for the paper "Exploring Facial Expression and Action Units in Parkinson Disease"☆10Dec 21, 2022Updated 3 years ago
- The offical realization of InstructERC☆148May 25, 2025Updated 10 months ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆38Dec 24, 2025Updated 3 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆19Jun 24, 2024Updated last year
- ☆17Mar 21, 2024Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆16Aug 4, 2023Updated 2 years ago
- decision support system for robotic sampling in precision agriculture☆14Dec 13, 2018Updated 7 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 5 months ago
- Random collection of code snippets used to create Deep Fakes☆15Jun 30, 2019Updated 6 years ago
- ☆12Nov 10, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker☆11Jul 18, 2017Updated 8 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A multimodal fine-grained correlation fusion network with attention mechanisms for visual-textual sentiment analysis☆10Jan 13, 2024Updated 2 years ago
- Datasets of Neuropsychological Language Tests in Brazilian Portuguese☆13Oct 14, 2025Updated 5 months ago
- A set of examples for basic audio data handling☆13Aug 15, 2020Updated 5 years ago
- ☆30May 7, 2024Updated last year
- ☆14Oct 12, 2024Updated last year
- Source code for the DeepViral paper☆10Mar 17, 2021Updated 5 years ago
- An optimized pipeline for working with Whole Slide Image (WSI) data in Tensorflow☆14Apr 30, 2021Updated 4 years ago
- Modality-Invariant Temporal Representation Learning☆22Apr 21, 2023Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆17Nov 27, 2024Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Functional NodeJS Application Example☆16Feb 28, 2024Updated 2 years ago
- Code of the paper https://arxiv.org/abs/2009.11939. A defocus blur estimation method.☆10Jan 13, 2022Updated 4 years ago
- NILC-Metrix gathers the metrics developed over more than a decade in NILC Lab.☆15Feb 23, 2026Updated last month
- ☆23Oct 23, 2024Updated last year
- ☆30Feb 14, 2026Updated last month