GAIR-NLP / thinking-with-generated-imagesView external linksLinks
Doodling our way to AGI โ๏ธ ๐ผ๏ธ ๐ง
โ122May 29, 2025Updated 8 months ago
Alternatives and similar repositories for thinking-with-generated-images
Users that are interested in thinking-with-generated-images are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videosโ24Aug 8, 2025Updated 6 months ago
- More reliable Video Understanding Evaluationโ14Sep 23, 2025Updated 4 months ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Imagesโ52Nov 4, 2025Updated 3 months ago
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)โ19Jul 1, 2025Updated 7 months ago
- Thinking with Programming Vision: Towards a Unified View for Thinking with Imagesโ54Jan 23, 2026Updated 3 weeks ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Modelsโ38Jan 5, 2026Updated last month
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoTโ123Jan 30, 2026Updated 2 weeks ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual inโฆโ1,329Feb 3, 2026Updated last week
- [ICLR 26] The official code repository for the paper "Mirage or Method? How ModelโTask Alignment Induces Divergent RL Conclusions".โ15Updated this week
- ChineseCLIP using online learningโ13Nov 7, 2022Updated 3 years ago
- UnicEdit-10M and UnicBench projectโ23Updated this week
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]โ15Jul 15, 2025Updated 6 months ago
- SFT+RL boosts multimodal reasoningโ45Jun 27, 2025Updated 7 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updatesโ22Jul 1, 2025Updated 7 months ago
- โ32Jan 25, 2026Updated 2 weeks ago
- โ16May 21, 2025Updated 8 months ago
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMsโ35Jan 18, 2026Updated 3 weeks ago
- โ13Jan 22, 2025Updated last year
- This is the official repository for the paper "MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning"โ59Dec 29, 2025Updated last month
- โ117Jul 22, 2025Updated 6 months ago
- โ50Oct 29, 2023Updated 2 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decodingโ34Jan 16, 2026Updated 3 weeks ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasonersโ86May 21, 2025Updated 8 months ago
- โ1,122Nov 20, 2025Updated 2 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Schedulingโ42Dec 29, 2025Updated last month
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]โ45Jul 22, 2025Updated 6 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?โ36Nov 5, 2025Updated 3 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Schemeโ147Apr 9, 2025Updated 10 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Rewardโ91Aug 8, 2025Updated 6 months ago
- โ20Jun 16, 2025Updated 7 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.โ19Oct 14, 2024Updated last year
- Official code repository of Shuffle-R1โ25Jan 27, 2026Updated 2 weeks ago
- โ42Jul 9, 2025Updated 7 months ago
- [NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acroโฆโ106Jan 9, 2026Updated last month
- โ64Feb 1, 2026Updated last week
- DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelangโ43Nov 19, 2025Updated 2 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}โ31Oct 2, 2025Updated 4 months ago
- โ19Mar 25, 2025Updated 10 months ago
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"โ165Feb 4, 2026Updated last week