ZJU-REAL / SVGeniusLinks
[ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆75Updated 3 months ago
Alternatives and similar repositories for SVGenius
Users that are interested in SVGenius are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆116Updated 2 months ago
- Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-e…☆92Updated 2 months ago
- ☆23Updated last month
- ☆59Updated 7 months ago
- Official Repository of ACL 2025 paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆145Updated 11 months ago
- This is a project about visual spatial reasoning.☆89Updated last month
- [ICLR 2026] An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆165Updated last week
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆45Updated 3 months ago
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆44Updated 7 months ago
- ☆169Updated 2 months ago
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 7 months ago
- [ICML 2025] Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences☆29Updated 7 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆25Updated 8 months ago
- [ICLR 2026] The official repository for paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"☆143Updated 2 weeks ago
- [ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆162Updated 5 months ago
- OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation☆255Updated 4 months ago
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆342Updated 9 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆251Updated 3 months ago
- ☆174Updated 3 weeks ago
- A benchmark evaluates LLMs' performance in automating drawing revision tasks.☆57Updated last month
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆149Updated this week
- Step-DeepResearch☆499Updated last week
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆237Updated last week
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Updated 6 months ago
- Official implementation of GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization☆374Updated last month
- [AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding☆117Updated 3 months ago
- ✨✨ [ICLR 2026] R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆281Updated 9 months ago
- LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling☆187Updated 2 weeks ago
- [ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".☆63Updated 2 weeks ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆303Updated last week