ZJU-REAL / SVGeniusLinks
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆62Updated 2 months ago
Alternatives and similar repositories for SVGenius
Users that are interested in SVGenius are comparing it to the libraries listed below
Sorting:
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆134Updated this week
- Official Code for PosterGen☆48Updated this week
- Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆40Updated 3 months ago
- ☆23Updated last week
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆42Updated last month
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆25Updated 3 months ago
- This is a project about visual spatial reasoning.☆53Updated last week
- Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆132Updated last week
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆292Updated last week
- Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences (ICML 2025)☆24Updated 2 months ago
- Official implementation of MC-LLaVA.☆139Updated last week
- ✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆251Updated 3 months ago
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆101Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 7 months ago
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆181Updated 3 weeks ago
- ☆42Updated 2 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆121Updated 3 weeks ago
- ☆30Updated last month
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆296Updated 3 months ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆101Updated 4 months ago
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆184Updated last month
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆23Updated last week
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆172Updated 3 months ago
- Efficient Reasoning Vision Language Models☆366Updated last week
- [CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models☆219Updated last month
- The Next Step Forward in Multimodal LLM Alignment☆176Updated 4 months ago
- ☆104Updated last month
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆25Updated 3 months ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆90Updated 2 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆25Updated this week