[ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139
☆75Nov 10, 2025Updated 3 months ago
Alternatives and similar repositories for SVGenius
Users that are interested in SVGenius are comparing it to the libraries listed below
Sorting:
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated 9 months ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆45Oct 20, 2025Updated 4 months ago
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 4 months ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆39Sep 30, 2025Updated 5 months ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆47Feb 12, 2026Updated 3 weeks ago
- ☆36Oct 9, 2025Updated 4 months ago
- ☆18May 15, 2025Updated 9 months ago
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆61Nov 8, 2025Updated 3 months ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆306Feb 11, 2026Updated 3 weeks ago
- A Top-Down Approach for Image Vectorization☆22Dec 17, 2023Updated 2 years ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)☆44Jun 11, 2025Updated 8 months ago
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆46Nov 12, 2025Updated 3 months ago
- CoDi:Subject-Consistent and Pose-Diverse Text-to-Image Generation☆37Aug 1, 2025Updated 7 months ago
- (CVPR 2025) Code of "Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models"☆212Apr 2, 2025Updated 11 months ago
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆303Feb 2, 2026Updated last month
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆41Jan 27, 2026Updated last month
- Generate SVG schematics and block diagrams without a mouse.☆31Jul 5, 2025Updated 8 months ago
- [CVPR 2025] MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention☆40Mar 12, 2025Updated 11 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆17Aug 1, 2025Updated 7 months ago
- ☆43May 30, 2025Updated 9 months ago
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models☆74Jan 29, 2026Updated last month
- ☆11Oct 24, 2024Updated last year
- 3D Editing via Propagation of Image Prompts to Multi-View☆18Nov 30, 2025Updated 3 months ago
- An Advanced Basic Math Reasoning and Overthinking Evaluation Framework for LLMs☆12Jul 8, 2025Updated 7 months ago
- High Scale Digital Asset Exchange☆10Jul 17, 2020Updated 5 years ago
- 图像工程课程设计 基于 OpenCV 、 Qt 库实现的图像处理软件 大学编程作业(TUST 天津科技大学 2023 年)☆13Aug 3, 2023Updated 2 years ago
- ☆60Updated this week
- ☆15Jul 26, 2025Updated 7 months ago
- ☆17Aug 5, 2025Updated 7 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- A connector to integrate FreeSWITCH and OpenERP/Odoo☆14May 16, 2016Updated 9 years ago
- ☆72Oct 13, 2025Updated 4 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- KiCAD plugin written in Python for programatically placing clusters of components onto a PCB from a layout file.☆10Jun 30, 2021Updated 4 years ago
- Background Subtraction for complex scenes such as intersections from surveillance cameras☆10Jul 15, 2022Updated 3 years ago
- Utility to convert a KiCad netlist into a PCBNEW .kicad_pcb file.☆14Nov 4, 2025Updated 4 months ago
- 一个 Windows 端高效 3D 重建平台,集成 OpenMVG 和 OpenMVS,覆盖从稀疏点云到完整贴图 Mesh 的完整流程;基于 Qt5,自动可视化重建,界面直观,操作简单。☆21Jan 16, 2025Updated last year