joanrod / star-vectorLinks
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.
☆4,208Updated 2 months ago
Alternatives and similar repositories for star-vector
Users that are interested in star-vector are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs…☆2,312Updated 3 weeks ago
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,792Updated 2 months ago
- Pioneering Automated GUI Interaction with Native Agents☆9,060Updated last week
- OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871☆4,007Updated last month
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,916Updated last week
- Agent S: an open agentic framework that uses computers like a human☆9,515Updated last week
- Official implementation of NerualSVG☆1,401Updated last month
- MAGI-1: Autoregressive Video Generation at Scale☆3,635Updated 7 months ago
- Vibe Workflow Platform for Non-technical Creators.☆6,000Updated this week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆3,595Updated this week
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆4,453Updated 7 months ago
- Official repository for LTX-Video☆9,174Updated 3 weeks ago
- Driving all platforms UI automation with vision-based model☆11,309Updated last week
- A research prototype of a human-centered web agent☆9,608Updated this week
- ⚙️ Create and run workflows (RPA 2.0)☆3,855Updated last week
- OCR & Document Extraction using vision models☆12,032Updated 8 months ago
- Local-first AI coworker, with memory☆4,329Updated this week
- Fully local web research and report writing assistant☆8,477Updated 5 months ago
- Kortix – build, manage and train AI Agents.☆19,227Updated this week
- Open-source unified multimodal model☆5,577Updated 2 months ago
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,551Updated this week
- SkyReels-V2: Infinite-length Film Generative model☆5,910Updated 5 months ago
- Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.☆7,071Updated 3 weeks ago
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,204Updated 4 months ago
- The python library for real-time communication☆4,487Updated 2 weeks ago
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,662Updated last month
- Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process depl…☆9,292Updated last month
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆4,299Updated last month
- A free + OSS logo generator powered by Flux on Together AI☆6,198Updated last month
- Lets make video diffusion practical!☆16,553Updated 3 months ago