tianyi-lab / CoSTARLinks
Cost-Sensitive Toolpath Agent for Multi-turn Image Editing
☆23Updated 5 months ago
Alternatives and similar repositories for CoSTAR
Users that are interested in CoSTAR are comparing it to the libraries listed below
Sorting:
- ☆12Updated 8 months ago
- Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing☆27Updated 2 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆77Updated 9 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆16Updated 11 months ago
- ☆57Updated 3 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆17Updated last month
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆35Updated 5 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 6 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last month
- ☆17Updated 8 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Updated last year
- Test-time Scaling for VAR models☆23Updated this week
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 8 months ago
- ☆75Updated 3 months ago
- ☆11Updated 11 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆48Updated 4 months ago
- ☆41Updated last year
- ☆19Updated 2 months ago
- Distilling Diversity and Control in Diffusion Models☆45Updated 4 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆66Updated 4 months ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆30Updated 2 months ago
- Quick Long Video Understanding☆64Updated 3 months ago
- ☆18Updated 11 months ago
- Official Implementation of Muddit [Meissonic II]: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model.☆83Updated last month
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆13Updated 7 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆46Updated 2 months ago
- [NeurIPS 2024] A task generation and model evaluation system for multimodal language models.☆73Updated 9 months ago
- [ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. 🚀🌙Accelerating off-the-shelf diffusion model with a uni…☆28Updated 2 months ago
- Multimodal RewardBench☆46Updated 7 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆49Updated 11 months ago