Doodling our way to AGI βοΈ πΌοΈ π§
β124May 29, 2025Updated 11 months ago
Alternatives and similar repositories for thinking-with-generated-images
Users that are interested in thinking-with-generated-images are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videosβ27Aug 8, 2025Updated 9 months ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual inβ¦β1,440Mar 9, 2026Updated 2 months ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Imagesβ59Nov 4, 2025Updated 6 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?β92Jul 13, 2025Updated 9 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.β36Dec 30, 2025Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoTβ134Jan 30, 2026Updated 3 months ago
- β19Jan 26, 2025Updated last year
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Schemeβ148Apr 9, 2025Updated last year
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acroβ¦β119Feb 10, 2026Updated 2 months ago
- β1,204Nov 20, 2025Updated 5 months ago
- More reliable Video Understanding Evaluationβ15Sep 23, 2025Updated 7 months ago
- β51Oct 29, 2023Updated 2 years ago
- β45Mar 24, 2026Updated last month
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)β19Jul 1, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code repository of Shuffle-R1β25Feb 23, 2026Updated 2 months ago
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelangβ44Nov 19, 2025Updated 5 months ago
- [ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"β183May 1, 2026Updated last week
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factualityβ44Dec 1, 2025Updated 5 months ago
- β123Jul 22, 2025Updated 9 months ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusioβ¦β113Apr 13, 2026Updated 3 weeks ago
- EARL: Editing with Autoregression and RLβ42Nov 21, 2025Updated 5 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Modelsβ41Jan 5, 2026Updated 4 months ago
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMsβ45Mar 27, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [Extended verision ICLR 2025 Blog Track] Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generatioβ¦β837Jun 16, 2025Updated 10 months ago
- β30Jul 2, 2024Updated last year
- GenEval: An object-focused framework for evaluating text-to-image alignmentβ448Mar 3, 2025Updated last year
- SFT+RL boosts multimodal reasoningβ48Jun 27, 2025Updated 10 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Rewardβ94Aug 8, 2025Updated 9 months ago
- [ICLR26] GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learningβ105Jan 27, 2026Updated 3 months ago
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Researchβ28Sep 23, 2025Updated 7 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Schedulingβ42Dec 29, 2025Updated 4 months ago
- [NIPS 25'] Evaluation code of paper "KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models"β45Oct 19, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2026] UnicEdit-10M and UnicBench projectβ41Mar 3, 2026Updated 2 months ago
- β22Apr 15, 2025Updated last year
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detectionβ13Apr 12, 2024Updated 2 years ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoningβ237May 30, 2025Updated 11 months ago
- β21Jun 16, 2025Updated 10 months ago
- [CVPR 2025] π₯ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".β459Aug 8, 2025Updated 9 months ago
- β16Nov 18, 2023Updated 2 years ago