joanrod / star-vectorLinks
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.
☆4,173Updated last month
Alternatives and similar repositories for star-vector
Users that are interested in star-vector are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs…☆2,279Updated last week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆3,153Updated this week
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,487Updated last week
- Fully local web research and report writing assistant☆8,426Updated 4 months ago
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,590Updated last month
- The python library for real-time communication☆4,469Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆16,483Updated last week
- OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871☆3,979Updated 3 weeks ago
- Agent S: an open agentic framework that uses computers like a human☆9,211Updated 2 weeks ago
- ⚙️ Create and run workflows (RPA 2.0)☆3,829Updated last week
- 🪄 Create rich visualizations with AI☆14,602Updated 3 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,847Updated last week
- Official repository for LTX-Video☆8,946Updated 2 months ago
- SkyReels-V2: Infinite-length Film Generative model☆5,375Updated 4 months ago
- Pioneering Automated GUI Interaction with Native Agents☆8,710Updated this week
- Driving all platforms UI automation with vision-based model☆11,076Updated this week
- ☆10,022Updated 4 months ago
- Kortix – build, manage and train AI Agents.☆18,902Updated this week
- Official Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]☆1,443Updated 5 months ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆15,022Updated 2 weeks ago
- Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural l…☆3,791Updated 7 months ago
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,936Updated this week
- Stay on top of trending topics on social media and the web with AI☆3,920Updated 10 months ago
- Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process depl…☆9,128Updated 3 weeks ago
- Local-first, open-source tools for automating everyday work.☆4,299Updated this week
- Simultaneous speech-to-text model☆9,373Updated last week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,628Updated 5 months ago
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,651Updated 3 weeks ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,884Updated 2 months ago
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl☆6,131Updated 7 months ago