roboflow / supervisionLinks
We write your reusable computer vision tools. 💜
☆36,270Updated this week
Alternatives and similar repositories for supervision
Users that are interested in supervision are comparing it to the libraries listed below
Sorting:
- Python scraper based on AI☆22,184Updated this week
- Automate browser based workflows with AI☆20,054Updated this week
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆99,647Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆28,236Updated 8 months ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆41,250Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,089Updated 2 months ago
- Build AI Agents, Visually☆47,880Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆33,588Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆58,302Updated last week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆56,720Updated this week
- Making large AI models cheaper, faster and more accessible☆41,311Updated 2 weeks ago
- computer vision and sports☆4,822Updated 2 months ago
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆75,163Updated this week
- 🪄 Create rich visualizations with AI☆14,682Updated this week
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆159,175Updated this week
- A natural language interface for computers☆61,528Updated last month
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures l…☆9,057Updated this week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆16,370Updated last month
- 🙌 OpenHands: AI-Driven Development☆66,383Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,652Updated 2 weeks ago
- Python tool for converting files and office documents to Markdown.☆84,994Updated last month
- 🔊 Text-Prompted Generative Audio Model☆38,895Updated last year
- Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model s…☆17,586Updated this week
- OpenUI let's you describe UI using your imagination, then see it rendered live.☆21,963Updated last month
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.☆22,152Updated last month
- 21 Lessons, Get Started Building with Generative AI☆104,889Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,175Updated 3 months ago
- Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.☆22,955Updated 2 months ago
- The unified stack for running systems of agents: framework, runtime and control plane.☆36,666Updated this week
- RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tun…☆5,047Updated last month