joanrod / star-vectorLinks
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.
☆4,095Updated 2 weeks ago
Alternatives and similar repositories for star-vector
Users that are interested in star-vector are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs…☆2,225Updated 2 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,924Updated this week
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,126Updated this week
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,141Updated last week
- Agent S: an open agentic framework that uses computers like a human☆8,292Updated 3 weeks ago
- The Open-Source Agentic Workspace for Human-AI Collaboration.☆4,793Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,596Updated 4 months ago
- Fully local web research and report writing assistant☆8,328Updated 3 months ago
- Simultaneous speech-to-text model☆8,512Updated this week
- Official repository for LTX-Video☆8,779Updated 3 weeks ago
- OmniGen2: Exploration to Advanced Multimodal Generation.☆3,940Updated last month
- Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.☆10,381Updated last month
- ⚙️ Create and run workflows (RPA 2.0)☆3,776Updated this week
- Convert a Docker image to an executable☆2,016Updated 6 months ago
- Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.☆6,117Updated last week
- A free + OSS logo generator powered by Flux on Together AI☆6,096Updated 10 months ago
- 🪐 Markdown with superpowers — from ideas to papers, presentations and books.☆9,264Updated 2 weeks ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI O…☆11,345Updated this week
- A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local …☆8,254Updated last week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,712Updated last week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆10,959Updated this week
- Turn any webpage into structured data using LLMs☆6,103Updated 2 weeks ago
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆4,517Updated last week
- ☆14,164Updated last month
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆5,684Updated 3 weeks ago
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆6,891Updated 6 months ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆7,787Updated 2 weeks ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"☆7,808Updated last month
- The python library for real-time communication☆4,403Updated 2 months ago
- 🪄 Create rich visualizations with AI☆14,169Updated 2 weeks ago