joanrod / star-vectorLinks
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.
☆4,234Updated 3 months ago
Alternatives and similar repositories for star-vector
Users that are interested in star-vector are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs…☆2,348Updated last week
- ⚙️ Create and run workflows (RPA 2.0)☆3,876Updated last week
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,849Updated 3 months ago
- OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871☆4,020Updated 2 months ago
- A research prototype of a human-centered web agent☆9,632Updated 2 weeks ago
- Toolkit for linearizing PDFs for LLM datasets/training☆16,860Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,275Updated 11 months ago
- The python library for real-time communication☆4,519Updated 3 weeks ago
- Fully local web research and report writing assistant☆8,494Updated 6 months ago
- The first open-source agent skills builder. Define skills by vibe workflow, run on Claude Code, Cursor, Codex & more. Build Clawdbot 🦞· …☆6,486Updated this week
- Local-first AI coworker, with memory☆4,351Updated this week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,950Updated 2 weeks ago
- Train your AI self, amplify you, bridge the world☆15,087Updated 4 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,666Updated 6 months ago
- Official repository for LTX-Video☆9,235Updated last month
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,664Updated 2 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆5,842Updated this week
- Driving all platforms UI automation with vision-based model☆11,647Updated this week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆12,404Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,344Updated 4 months ago
- Dive is an open-source MCP Host Desktop Application that seamlessly integrates with any LLMs supporting function calling capabilities. ✨☆1,726Updated this week
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,582Updated last week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆7,217Updated last week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,397Updated last month
- Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost …☆24,939Updated 2 months ago
- 🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.☆5,415Updated this week
- Prompt Orchestration Markup Language☆4,846Updated 3 weeks ago
- Official implementation of NerualSVG☆1,401Updated last month
- Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization bui…☆9,649Updated this week
- Agent S: an open agentic framework that uses computers like a human☆9,713Updated 3 weeks ago