☆30Dec 14, 2025Updated 4 months ago
Alternatives and similar repositories for MPDD
Users that are interested in MPDD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Mar 18, 2026Updated 3 weeks ago
- [ACL 20205] Official respository for EvoPatient: LLMs Can Simulate Standardized Patients via Agent Coevolution☆20Jan 23, 2026Updated 2 months ago
- Official Implement of ECCV 2024 paper "Multi-modal Crowd Counting via a Broker Modality"☆17Mar 19, 2026Updated 3 weeks ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- [ACM-MM 2025 Workshop] More Is Better: A MoE-Based Emotion Recognition Framework with Human Preference Alignment.☆25Nov 25, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Jan 29, 2026Updated 2 months ago
- MMER☆17Jan 8, 2026Updated 3 months ago
- ☆11Feb 14, 2025Updated last year
- [ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"☆15Jul 6, 2024Updated last year
- Publically available AI generated faces☆13Sep 22, 2022Updated 3 years ago
- The final coursework for AI in Mental Health @ PKU.☆19Jan 5, 2024Updated 2 years ago
- Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified Benchmark☆26Oct 17, 2025Updated 5 months ago
- Dimensional estimation of emotions (Arousal, Valence, Intensity) from facial landmarks extracted by DLIB.☆29Jan 14, 2026Updated 3 months ago
- This GitHub provides the source code for the paper "Exploring Facial Expression and Action Units in Parkinson Disease"☆10Dec 21, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The offical realization of InstructERC☆148May 25, 2025Updated 10 months ago
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago
- [ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis☆38Dec 24, 2025Updated 3 months ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated last year
- EmoLLM: Multimodal Emotional Understanding Meets Large Language Models☆19Jun 24, 2024Updated last year
- ☆17Mar 21, 2024Updated 2 years ago
- Cross-Modality Attentive Feature Fusion for Object Detection in Multispectral Remote Sensing Imagery☆16Oct 7, 2022Updated 3 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆16Aug 4, 2023Updated 2 years ago
- code related to submission of "Accurate diagnosis of lymphoma on whole slide histopathology images using deep learning"☆10May 12, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 6 months ago
- Random collection of code snippets used to create Deep Fakes☆15Jun 30, 2019Updated 6 years ago
- ☆12Nov 10, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker☆11Jul 18, 2017Updated 8 years ago
- Datasets of Neuropsychological Language Tests in Brazilian Portuguese☆13Oct 14, 2025Updated 6 months ago
- A set of examples for basic audio data handling☆13Aug 15, 2020Updated 5 years ago
- ☆31May 7, 2024Updated last year
- Spatio-channel Attention Blocks for Cross-modal Crowd Counting -- Official Pytorch Implementation (ACCV'22, Oral)☆27Dec 4, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Oct 12, 2024Updated last year
- Source code for the DeepViral paper☆11Mar 17, 2021Updated 5 years ago
- An optimized pipeline for working with Whole Slide Image (WSI) data in Tensorflow☆14Apr 30, 2021Updated 4 years ago
- Modality-Invariant Temporal Representation Learning☆22Apr 21, 2023Updated 2 years ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- Codes available of a paper: An Efficient Cervical Whole Slide Image Analysis Framework Based on Multi-scale Semantic and Location Deep Fe…☆16Jul 26, 2022Updated 3 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago