The first comprehensive multimodal language analysis benchmark for evaluating foundation models
☆31Sep 22, 2025Updated 7 months ago
Alternatives and similar repositories for MMLA
Users that are interested in MMLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MIntRec2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (IC…☆78Aug 13, 2025Updated 9 months ago
- ☆16May 30, 2025Updated 11 months ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- Awesome papers for affective computing with llm and mllm☆24Nov 26, 2025Updated 5 months ago
- Resa: Transparent Reasoning Models via SAEs☆49Sep 23, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆50Nov 9, 2025Updated 6 months ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22Updated this week
- ☆14Jul 6, 2025Updated 10 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated last year
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆25Apr 6, 2025Updated last year
- ☆24Dec 23, 2024Updated last year
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated last month
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆49Sep 8, 2025Updated 8 months ago
- Quick Long Video Understanding [TMLR2025]☆78Oct 27, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Feb 26, 2024Updated 2 years ago
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model☆56May 31, 2025Updated 11 months ago
- [NeurIPS 2025] First SFT, Second RL, Third UPT: Continual Improving Multi-Modal LLM Reasoning via Unsupervised Post-Training☆86Oct 29, 2025Updated 6 months ago
- The official code repository for the FullFront benchmark☆27May 16, 2025Updated last year
- [ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliabilit…☆33Feb 5, 2026Updated 3 months ago
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding☆34Mar 21, 2025Updated last year
- ☆39Mar 24, 2025Updated last year
- 🚀 Pre-process, annotate, evaluate, and train your Affect Computing (e.g., Multimodal Emotion Recognition, Sentiment Analysis) datasets A…☆97Mar 13, 2026Updated 2 months ago
- Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/210…☆29Jan 4, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆43Nov 20, 2023Updated 2 years ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆110Sep 8, 2025Updated 8 months ago
- ☆101Jun 23, 2025Updated 10 months ago
- ☆12Jan 26, 2023Updated 3 years ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆116Sep 27, 2025Updated 7 months ago
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆12Mar 10, 2025Updated last year
- Implementation of Qformer from BLIP2 in Zeta Lego blocks.☆49Nov 11, 2024Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆15Jun 26, 2025Updated 10 months ago
- A reading list for research topics in multimodal deception detection.☆45Aug 29, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 11 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 10 months ago
- LIMI: Less is More for Agency☆161Oct 14, 2025Updated 7 months ago
- [Official Implementation] Improving Editability in Image Generation with Layer-wise Memory, CVPR 2025☆38Mar 2, 2026Updated 2 months ago
- Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"☆88May 27, 2025Updated 11 months ago
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆118Oct 28, 2025Updated 6 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year