The first comprehensive multimodal language analysis benchmark for evaluating foundation models
☆31Sep 22, 2025Updated 7 months ago
Alternatives and similar repositories for MMLA
Users that are interested in MMLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TCL-MAP is a powerful method for multimodal intent recognition (AAAI 2024)☆59Jan 25, 2024Updated 2 years ago
- MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)☆135May 2, 2025Updated 11 months ago
- MIntRec2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (IC…☆77Aug 13, 2025Updated 8 months ago
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 9 months ago
- ☆16May 30, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Awesome papers for affective computing with llm and mllm☆22Nov 26, 2025Updated 5 months ago
- Resa: Transparent Reasoning Models via SAEs☆48Sep 23, 2025Updated 7 months ago
- Koishi's Day 2024 Paper (NeurIPS 2024): An advanced persona-driven role-playing system with global faithfulness quantification and optimi…☆11Oct 19, 2025Updated 6 months ago
- ☆16Nov 11, 2025Updated 5 months ago
- This script uses an ensemble of multiple methods: RAKE, TF-IDF and Automatic Keyword Extraction to obtain top keywords in Reddit posts. P…☆12Jul 1, 2017Updated 8 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22Jul 26, 2025Updated 9 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Oct 15, 2025Updated 6 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated last year
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆25Apr 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Dec 23, 2024Updated last year
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated last week
- XL-VLMs: General Repository for eXplainable Large Vision Language Models☆48Sep 8, 2025Updated 7 months ago
- ☆19Nov 8, 2019Updated 6 years ago
- Utilities to parse type information and JSDoc annotations from TypeScript source files, and render Markdown documentation☆12Jun 24, 2023Updated 2 years ago
- Quick Long Video Understanding [TMLR2025]☆77Oct 27, 2025Updated 6 months ago
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model☆55May 31, 2025Updated 11 months ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆83Oct 29, 2025Updated 6 months ago
- [ICLR 25] A novel framework for building intrinsically interpretable LLMs with human-understandable concepts to ensure safety, reliabilit…☆32Feb 5, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding☆34Mar 21, 2025Updated last year
- Discovering New Intents with Deep Aligned Clustering (AAAI 2021)☆131Jul 1, 2022Updated 3 years ago
- ☆38Mar 24, 2025Updated last year
- ☆27Apr 29, 2025Updated last year
- ☆43Nov 20, 2023Updated 2 years ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆106Sep 8, 2025Updated 7 months ago
- Deep Unknown Intent Detection with Margin Loss (ACL2019)☆35Dec 8, 2022Updated 3 years ago
- ☆20Jun 12, 2020Updated 5 years ago
- ☆100Jun 23, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆115Sep 27, 2025Updated 7 months ago
- Implementation of Qformer from BLIP2 in Zeta Lego blocks.☆49Nov 11, 2024Updated last year
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Mar 26, 2026Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 11 months ago
- ☆15Feb 18, 2024Updated 2 years ago
- code for our EMNLP 2017 paper "DOC: Deep Open Classification of Text Documents"☆30Apr 18, 2019Updated 7 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 10 months ago