ZJU-REAL / SVGeniusLinks
[ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆74Updated last month
Alternatives and similar repositories for SVGenius
Users that are interested in SVGenius are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆114Updated last week
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆45Updated 2 months ago
- ☆57Updated 5 months ago
- ☆23Updated last month
- Official repo of paper "SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models". A post-training framework that creates a cost-e…☆88Updated 3 weeks ago
- This is a project about visual spatial reasoning.☆81Updated 2 weeks ago
- An official implementation of "SIM-CoT: Supervised Implicit Chain-of-Thought"☆112Updated 2 months ago
- Official Repository of ACL 2025 paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆144Updated 9 months ago
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆343Updated 8 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆25Updated 6 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆205Updated last week
- A benchmark evaluates LLMs' performance in automating drawing revision tasks.☆56Updated 3 months ago
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆47Updated 6 months ago
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 5 months ago
- ☆152Updated 3 weeks ago
- Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for Diffusion Large Language Models"☆155Updated 3 months ago
- [ICCV 2025] FonTS: Text Rendering with Typography and Style Controls☆35Updated last month
- [NeurIPS 2025 Spotlight] Official PyTorch implementation of Vgent☆32Updated 2 weeks ago
- 🚀 Daily AI Research Digest: Tracking breakthroughs in AI/NLP/CV/Robotics with dynamic updates and paper navigation.☆51Updated this week
- [AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding☆107Updated last month
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆44Updated 5 months ago
- Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆91Updated 3 weeks ago
- Official implementation of MC-LLaVA.☆139Updated last month
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆66Updated 6 months ago
- (ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning☆58Updated 2 months ago
- Incentivizing "Thinking with Long Videos" via Native Tool Calling☆142Updated this week
- Agentic MLLMs☆111Updated last month
- [ICML 2025] Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences☆29Updated 5 months ago
- A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating…☆127Updated 2 months ago
- The paper list of "Memory in the Age of AI Agents: A Survey"☆243Updated this week