Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆268Oct 17, 2025Updated 6 months ago
Alternatives and similar repositories for Awesome-Interleaving-Reasoning
Users that are interested in Awesome-Interleaving-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…☆93Jan 26, 2026Updated 3 months ago
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,408Apr 19, 2026Updated 2 weeks ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,440Mar 9, 2026Updated 2 months ago
- Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…☆113Aug 21, 2025Updated 8 months ago
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆975Mar 20, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆36Aug 28, 2025Updated 8 months ago
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆59Aug 24, 2025Updated 8 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆56Aug 28, 2025Updated 8 months ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆41Nov 26, 2025Updated 5 months ago
- A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…☆19Apr 23, 2025Updated last year
- FeatureAlignment = Alignment + Mechanistic Interpretability☆35Mar 8, 2025Updated last year
- Geometric Problem Solving Integrating FormalGeo Symbolic System and Hypergraph Neural Network.☆15Sep 23, 2025Updated 7 months ago
- ☆1,204Nov 20, 2025Updated 5 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆55Mar 21, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multi-step reasoning MLLM☆22Mar 8, 2026Updated 2 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆47Jul 22, 2025Updated 9 months ago
- ZJU毛概资料汇总☆10Mar 16, 2024Updated 2 years ago
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆202Apr 13, 2026Updated 3 weeks ago
- Awesome latest models, datasets and benchmarks on streaming/online video understanding.☆27Oct 19, 2025Updated 6 months ago
- ☆123Jul 22, 2025Updated 9 months ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- ☆32Sep 14, 2025Updated 7 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"☆63Mar 23, 2026Updated last month
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆765Feb 28, 2026Updated 2 months ago
- Paper list for Efficient Reasoning.☆882Updated this week
- [NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning☆266Oct 18, 2025Updated 6 months ago
- [ACL 2025 Findings] Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors☆87Jun 2, 2025Updated 11 months ago
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆365Jun 1, 2025Updated 11 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆74Jul 13, 2025Updated 9 months ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆69Jan 23, 2026Updated 3 months ago
- Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.☆56Nov 10, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆844May 14, 2025Updated 11 months ago
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆56Mar 31, 2026Updated last month
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆240Nov 7, 2025Updated 6 months ago
- [ACL 2024] Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue☆26Oct 18, 2025Updated 6 months ago
- ☆112Jan 8, 2025Updated last year
- ☆28Oct 28, 2024Updated last year