tianyi-lab / CoSTARLinks
Cost-Sensitive Toolpath Agent for Multi-turn Image Editing
☆25Updated 10 months ago
Alternatives and similar repositories for CoSTAR
Users that are interested in CoSTAR are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆79Updated last year
- [ICLR 2026] Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆29Updated last week
- [ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. 🚀🌙Accelerating off-the-shelf diffusion model with a uni…☆35Updated 6 months ago
- ☆21Updated 4 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Updated last year
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Updated 3 months ago
- ☆13Updated last year
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆64Updated 4 months ago
- ☆41Updated last year
- ☆39Updated 8 months ago
- ☆64Updated last week
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Updated 6 months ago
- ☆63Updated 7 months ago
- ☆107Updated 8 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆129Updated 6 months ago
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆30Updated 10 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆31Updated 4 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆114Updated 7 months ago
- ☆81Updated 7 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Updated last year
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆47Updated 8 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆31Updated last month
- ☆17Updated last year
- Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆79Updated 2 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆60Updated 3 months ago
- [ICLR 2026] Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusio…☆98Updated last week
- [ICLR 2026] Uni-CoT: Towards Unified Chain-of-Thought Reasoning Across Text and Vision☆207Updated 2 weeks ago
- ☆37Updated 2 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Updated last year