[CVPR 2026 (Findings) π₯π₯] Self Evolving Large Multimodal Models with Continuous Rewards
β20Mar 5, 2026Updated this week
Alternatives and similar repositories for EvoLMM
Users that are interested in EvoLMM are comparing it to the libraries listed below
Sorting:
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ22Jan 26, 2026Updated last month
- β33Jul 8, 2025Updated 8 months ago
- β56Nov 12, 2025Updated 3 months ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.β33Jul 21, 2023Updated 2 years ago
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligenceβ87Feb 25, 2026Updated last week
- [CVPR 2026] A training-free, mask-free framework for 3D shape editing.β25Dec 12, 2025Updated 2 months ago
- β55Updated this week
- [CVPR 2026] Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"β84Feb 13, 2026Updated 3 weeks ago
- Official implementation of "Imaginarium: Vision-guided High-quality 3D Scene Layout Generation"β43Dec 30, 2025Updated 2 months ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"β11Sep 3, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Modelsβ12Nov 1, 2025Updated 4 months ago
- β29Jan 15, 2026Updated last month
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flexβ¦β22Mar 2, 2026Updated last week
- β13Nov 5, 2024Updated last year
- β15Feb 11, 2025Updated last year
- [ICLR2025] Are Large Vision Language Models Good Game Players?β12Mar 3, 2025Updated last year
- β35Nov 17, 2025Updated 3 months ago
- β16Sep 1, 2025Updated 6 months ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captionsβ23Feb 11, 2026Updated 3 weeks ago
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensiβ¦β21Jun 12, 2025Updated 8 months ago
- The PyTorch implementation of DSM (EMNLP 2022).β10Mar 26, 2024Updated last year
- Internal utility libraries for Pklβ15Mar 2, 2026Updated last week
- β12Jun 20, 2023Updated 2 years ago
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fouβ¦β31Feb 1, 2026Updated last month
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)β15Aug 6, 2025Updated 7 months ago
- β20Updated this week
- β34Jan 9, 2026Updated 2 months ago
- an incremental learning frameworkβ47Sep 9, 2023Updated 2 years ago
- [CVPR 2025] PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Modelsβ56Jan 30, 2026Updated last month
- The source code of paper: Learning Disentangled Semantic Representations for Zero-Shot Cross-Lingual Transfer in Multilingual Machine Reaβ¦β12Apr 6, 2022Updated 3 years ago
- β64Feb 4, 2026Updated last month
- Create your own 3D scene with words anywhere.β32Updated this week
- The first toolkit for MLRM safety evaluation, providing unified interface for mainstream models, datasets, and jailbreaking methods!β15Apr 8, 2025Updated 11 months ago
- β20Dec 3, 2025Updated 3 months ago
- β11Jul 31, 2022Updated 3 years ago
- β11Mar 5, 2025Updated last year
- β18Apr 10, 2025Updated 10 months ago
- PyTorch code for the CVPR'23 paper: "ConStruct-VL: Data-Free Continual Structured VL Concepts Learning"β14Feb 5, 2024Updated 2 years ago
- β31Oct 21, 2025Updated 4 months ago