mti-lab / SVGEditBenchLinks
A benchmark dataset for evaluating LLM's SVG editing capabilities
☆36Updated last year
Alternatives and similar repositories for SVGEditBench
Users that are interested in SVGEditBench are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- ☆19Updated last year
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆85Updated 10 months ago
- ☆70Updated 4 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆63Updated last year
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆79Updated last year
- LayoutFlow: Flow Matching for Layout Generation [Andrade Guerreiro et al., ECCV 2024]☆35Updated 4 months ago
- ☆81Updated 7 months ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆130Updated 8 months ago
- ☆79Updated last year
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆33Updated 7 months ago
- Diffusion Layout Transformer implementation.☆63Updated 2 years ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆160Updated last year
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆159Updated 4 months ago
- ☆53Updated 2 years ago
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆185Updated last year
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆74Updated last year
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆45Updated 3 years ago
- ☆41Updated last year
- A Video Tokenizer Evaluation Dataset☆150Updated last year
- Code for the paper "AutoPresent: Designing Structured Visuals From Scratch" (CVPR 2025)☆153Updated 8 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆153Updated last year
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆133Updated last week
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆78Updated 6 months ago
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)☆41Updated 4 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆85Updated 2 years ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆134Updated 10 months ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆181Updated last year
- Code for ACM MM'23 paper: LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation☆50Updated last year
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆89Updated last year