joanrod / star-vectorLinks
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.
☆4,045Updated 5 months ago
Alternatives and similar repositories for star-vector
Users that are interested in star-vector are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs…☆2,143Updated 3 weeks ago
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆4,895Updated 2 weeks ago
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆3,906Updated 2 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,554Updated this week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,723Updated 2 weeks ago
- Fully local web research and report writing assistant☆8,182Updated 2 months ago
- 🪄 Create rich visualizations with AI☆13,731Updated last week
- Agent S: an open agentic framework that uses computers like a human☆6,886Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,529Updated 2 months ago
- ☆13,719Updated last month
- The Open-Source Agentic Workspace for Human-AI Collaboration.☆4,714Updated last week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆4,850Updated last month
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containeriz…☆8,619Updated 3 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,192Updated 7 months ago
- SkyReels-V2: Infinite-length Film Generative model☆4,633Updated last month
- A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local …☆7,647Updated last week
- MAGI-1: Autoregressive Video Generation at Scale☆3,499Updated 3 months ago
- ⚙️ Create and run workflows (RPA 2.0)☆3,714Updated last week
- 🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.☆4,648Updated last week
- OmniGen2: Exploration to Advanced Multimodal Generation.☆3,883Updated last week
- OCR & Document Extraction using vision models☆11,866Updated 4 months ago
- An open source deep research clone. AI Agent that reasons large amounts of web data extracted with Firecrawl☆6,051Updated 5 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,629Updated last week
- Open-source unified multimodal model☆5,118Updated last month
- Toolkit for linearizing PDFs for LLM datasets/training☆14,208Updated this week
- A research prototype of a human-centered web agent☆7,739Updated this week
- Put an end to code hallucinations! GitMCP is a free, open-source, remote MCP server for any GitHub project☆6,566Updated last month
- Yet Another Document Translator☆5,353Updated this week
- Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.☆5,583Updated last week
- Local-first AI Notepad for Private Meetings☆6,286Updated this week