ZJU-REAL / SVGeniusLinks
[ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆67Updated 4 months ago
Alternatives and similar repositories for SVGenius
Users that are interested in SVGenius are comparing it to the libraries listed below
Sorting:
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆168Updated last week
- ☆22Updated 2 months ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆109Updated last month
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆44Updated 2 weeks ago
- Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆146Updated last month
- [NeurIPS 2025] Efficient Reasoning Vision Language Models☆412Updated last month
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆43Updated 3 months ago
- An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆97Updated last month
- This is a project about visual spatial reasoning.☆76Updated last week
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆102Updated last month
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆207Updated 3 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆24Updated 5 months ago
- ☆49Updated this week
- ✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆267Updated 6 months ago
- ☆53Updated 4 months ago
- Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences (ICML 2025)☆25Updated 4 months ago
- Official implementation of MC-LLaVA.☆140Updated 2 months ago
- About Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a …☆81Updated 3 weeks ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆123Updated last month
- ☆32Updated 3 months ago
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆68Updated 3 months ago
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆357Updated last month
- Official Repository of ACL 2025 paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆143Updated 8 months ago
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆345Updated 6 months ago
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆27Updated 3 months ago
- (ICCV 2025) Enhance CLIP and MLLM's fine-grained visual representations with generative models.☆73Updated 4 months ago
- A python script for downloading huggingface datasets and models.☆20Updated 7 months ago
- A collection of multimodal reasoning papers, codes, datasets, benchmarks and resources.☆326Updated 2 weeks ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆193Updated 3 weeks ago
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆322Updated 5 months ago