ZJU-REAL / SVGeniusLinks
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆65Updated 4 months ago
Alternatives and similar repositories for SVGenius
Users that are interested in SVGenius are comparing it to the libraries listed below
Sorting:
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆160Updated this week
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆41Updated 3 weeks ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆106Updated last month
- ☆22Updated last month
- This is a project about visual spatial reasoning.☆73Updated last week
- About Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a …☆38Updated this week
- An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆91Updated 3 weeks ago
- A Unified Framework for High-Performance and Extensible LLM Steering☆79Updated this week
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆192Updated 5 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆24Updated 4 months ago
- Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆136Updated last month
- ✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆265Updated 5 months ago
- A benchmark evaluates LLMs' performance in automating drawing revision tasks.☆56Updated last month
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆49Updated 4 months ago
- Official Repository of ACL 2025 paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆143Updated 7 months ago
- (ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning☆58Updated 2 weeks ago
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆198Updated 2 months ago
- ☆51Updated 3 months ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 2 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆121Updated 2 weeks ago
- Tree Search for LLM Agent Reinforcement Learning☆161Updated 3 weeks ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆96Updated last month
- Official implementation of MC-LLaVA.☆140Updated last month
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆63Updated 4 months ago
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆30Updated last month
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆344Updated 6 months ago
- Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆65Updated 2 weeks ago
- ☆36Updated last week
- [NeurIPS 2025] Efficient Reasoning Vision Language Models☆405Updated last month
- A Gaussian dense reward framework for GUI grounding training☆227Updated last month