mti-lab / SVGEditBenchLinks
A benchmark dataset for evaluating LLM's SVG editing capabilities
☆34Updated last year
Alternatives and similar repositories for SVGEditBench
Users that are interested in SVGEditBench are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- ☆19Updated last year
- OpenCOLE: Towards Reproducible Automatic Graphic Design Generation [Inoue+, CVPRW2024 (GDUG)]☆84Updated 9 months ago
- ☆78Updated last year
- ☆68Updated 3 months ago
- LayoutFlow: Flow Matching for Layout Generation [Andrade Guerreiro et al., ECCV 2024]☆34Updated 3 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆79Updated last year
- ☆80Updated 6 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆129Updated 7 months ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆33Updated 6 months ago
- (ICCV 2025) "Principal Components" Enable A New Language of Images☆76Updated 5 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆63Updated last year
- A Video Tokenizer Evaluation Dataset☆147Updated 11 months ago
- [ICML 2025 Tokshop] One-D-Piece: Image Tokenizer Meets Quality-Controllable Compression☆72Updated 5 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆85Updated 2 years ago
- 🕹️ Explore cutting-edge techniques in game generation☆52Updated 4 months ago
- Diffusion Layout Transformer implementation.☆63Updated 2 years ago
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)☆39Updated 3 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Updated last year
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆158Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆90Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆80Updated last year
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆185Updated last year
- Official implementation of the paper The Hidden Language of Diffusion Models☆77Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Updated last year
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆133Updated 9 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆89Updated last year
- VPEval Codebase from Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆45Updated 2 years ago
- This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…☆153Updated 3 months ago