[CVPR 2026 (Findings) π₯π₯] Self Evolving Large Multimodal Models with Continuous Rewards
β21Mar 5, 2026Updated last month
Alternatives and similar repositories for EvoLMM
Users that are interested in EvoLMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videosβ23Jan 26, 2026Updated 2 months ago
- β12Jun 20, 2023Updated 2 years ago
- β11Mar 5, 2025Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memoryβ61Feb 28, 2025Updated last year
- β62Nov 12, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- β12Mar 20, 2023Updated 3 years ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understandingβ29Dec 18, 2025Updated 4 months ago
- A huge dataset for Document Visual Question Answeringβ22Jul 29, 2024Updated last year
- β39Jul 8, 2025Updated 9 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flexβ¦β26Apr 4, 2026Updated 2 weeks ago
- Internal utility libraries for Pklβ16Updated this week
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.β33Jul 21, 2023Updated 2 years ago
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligenceβ104Updated this week
- Multilingual and Multiculture Benchmark and LLMβ32Apr 10, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository contains PyTorch implementation of the paper ''LFighter: Defending against Label-flipping Attacks in Federated Learning''β¦β19Mar 6, 2026Updated last month
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewardsβ37Oct 3, 2025Updated 6 months ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoningβ19Oct 6, 2025Updated 6 months ago
- Rethinking the Trust Region in LLM Reinforcement Learningβ52Mar 2, 2026Updated last month
- β35Nov 5, 2024Updated last year
- β21Dec 3, 2025Updated 4 months ago
- [CVPR 2025] PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Modelsβ59Jan 30, 2026Updated 2 months ago
- β35Mar 31, 2026Updated 2 weeks ago
- β35Mar 13, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Modelsβ13Nov 1, 2025Updated 5 months ago
- [CVPR -2025] GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Modelβ129Mar 22, 2025Updated last year
- Repo for the paper: Towards Few-shot Entity Recognition in Document Images:A Label-aware Sequence-to-Sequence Frameworkβ14May 31, 2023Updated 2 years ago
- HOCR Specification Python Parserβ12Sep 23, 2015Updated 10 years ago
- The training codes of Jasper-Token-Compression-600Mβ19Nov 19, 2025Updated 5 months ago
- β33Oct 23, 2025Updated 5 months ago
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrievalβ36Sep 12, 2025Updated 7 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documentsβ37Sep 9, 2023Updated 2 years ago
- β13Nov 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ROSA+: RWKV's ROSA implementation with fallback statistical predictorβ34Oct 13, 2025Updated 6 months ago
- β148Apr 8, 2026Updated last week
- β36Jan 9, 2026Updated 3 months ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.β14Oct 12, 2024Updated last year
- Graph Masked Autoencodersβ27Aug 28, 2022Updated 3 years ago
- The official repository of the first version of ACE-Brain foundation model.β74Mar 13, 2026Updated last month
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captionsβ30Feb 11, 2026Updated 2 months ago