Osilly / Awesome-Interleaving-ReasoningView external linksLinks
Interleaving Reasoning: Next-Generation Reasoning Systems for AGI
☆251Oct 17, 2025Updated 3 months ago
Alternatives and similar repositories for Awesome-Interleaving-Reasoning
Users that are interested in Awesome-Interleaving-Reasoning are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…☆87Jan 26, 2026Updated 2 weeks ago
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,349Dec 7, 2025Updated 2 months ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,329Feb 3, 2026Updated last week
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆22May 31, 2025Updated 8 months ago
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆760Jan 26, 2026Updated 2 weeks ago
- 🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL☆60Aug 24, 2025Updated 5 months ago
- [Up-To-Date] Awesome Agent Memory Paper Resource☆50Updated this week
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆54Mar 21, 2025Updated 10 months ago
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆165Feb 4, 2026Updated last week
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Mar 8, 2025Updated 11 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆34Aug 28, 2025Updated 5 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆39Jul 22, 2025Updated 6 months ago
- Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.☆55Nov 10, 2025Updated 3 months ago
- ☆37Nov 26, 2025Updated 2 months ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆60Jan 28, 2026Updated 2 weeks ago
- Offline implementation of UniREditBench: A Unified Reasoning-based Image Editing Benchmark.☆52Jan 7, 2026Updated last month
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated last month
- Paper list for Efficient Reasoning.☆822Jan 31, 2026Updated 2 weeks ago
- [TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆736Oct 20, 2025Updated 3 months ago
- ☆1,122Nov 20, 2025Updated 2 months ago
- ☆19Aug 7, 2025Updated 6 months ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆25Oct 18, 2025Updated 3 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆122May 29, 2025Updated 8 months ago
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors (ACL Findings 2025)☆88Jun 2, 2025Updated 8 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆68Jan 23, 2026Updated 3 weeks ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated 11 months ago
- ZJU毛概资料汇总☆10Mar 16, 2024Updated last year
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆24Aug 8, 2025Updated 6 months ago
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆36Updated this week
- A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…☆18Apr 23, 2025Updated 9 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated this week
- [KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations☆83Feb 6, 2026Updated last week
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 4 months ago
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆79Updated this week
- code for promptCSE, emnlp 2022☆11Apr 10, 2023Updated 2 years ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated 3 weeks ago
- Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue (TOIS)☆13Oct 18, 2025Updated 3 months ago
- Awesome Unified Multimodal Models☆1,108Feb 6, 2026Updated last week