ZJU-REAL / SVGeniusLinks
[ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆68Updated 2 weeks ago
Alternatives and similar repositories for SVGenius
Users that are interested in SVGenius are comparing it to the libraries listed below
Sorting:
- ☆23Updated 2 weeks ago
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆114Updated 2 months ago
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆44Updated 4 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆45Updated last month
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆198Updated this week
- An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆101Updated 2 months ago
- ☆54Updated 5 months ago
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆212Updated this week
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆33Updated 3 weeks ago
- Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences (ICML 2025)☆26Updated 5 months ago
- This is a project about visual spatial reasoning.☆79Updated last month
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆24Updated 6 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆66Updated 6 months ago
- Official implementation of MC-LLaVA.☆139Updated 2 weeks ago
- Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆151Updated 2 months ago
- About Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a …☆83Updated this week
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆125Updated last month
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆209Updated last month
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 4 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆47Updated 5 months ago
- OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation☆239Updated 2 months ago
- A python script for downloading huggingface datasets and models.☆20Updated 7 months ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆107Updated last week
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆51Updated 3 weeks ago
- Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆75Updated this week
- Official Code for "Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search"☆368Updated 2 months ago
- [NeurIPS 2025] Efficient Reasoning Vision Language Models☆419Updated 2 months ago
- Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"☆210Updated 3 months ago
- ☆32Updated 4 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆51Updated 3 weeks ago