tianyi-lab / CoSTARLinks
Cost-Sensitive Toolpath Agent for Multi-turn Image Editing
☆25Updated 9 months ago
Alternatives and similar repositories for CoSTAR
Users that are interested in CoSTAR are comparing it to the libraries listed below
Sorting:
- Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆28Updated 6 months ago
- ☆13Updated 11 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆79Updated last year
- [ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. 🚀🌙Accelerating off-the-shelf diffusion model with a uni…☆34Updated 5 months ago
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆29Updated 9 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆27Updated 2 weeks ago
- ☆21Updated 3 months ago
- ☆20Updated 5 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Updated 10 months ago
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆30Updated 2 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆17Updated last year
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 11 months ago
- ☆17Updated 11 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Updated 2 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆51Updated last month
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆17Updated 4 months ago
- Quick Long Video Understanding☆71Updated 2 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Updated last year
- More reliable Video Understanding Evaluation☆13Updated 3 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated 2 months ago
- 🔥 [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospe…☆50Updated 3 months ago
- [Arxiv 2025] SparseD: Sparse Attention for Diffusion Language Models☆53Updated 2 months ago
- ☆23Updated 7 months ago
- Official Repository of Personalized Visual Instruct Tuning☆33Updated 9 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆46Updated last month
- ☆15Updated 6 months ago
- Official Code for "ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning"☆72Updated 3 weeks ago
- The official repo of continuous speculative decoding☆31Updated 9 months ago
- ☆80Updated 6 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆27Updated 5 months ago