roboflow / supervisionLinks
We write your reusable computer vision tools. ๐
โ34,592Updated this week
Alternatives and similar repositories for supervision
Users that are interested in supervision are comparing it to the libraries listed below
Sorting:
- OCR, layout analysis, reading order, table recognition in 90+ languagesโ18,457Updated this week
- RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning.โ2,908Updated this week
- Convert PDF to markdown + JSON quickly with high accuracyโ28,444Updated this week
- ๐ OpenHands: Code Less, Make Moreโ62,981Updated this week
- Ultralytics YOLO ๐โ45,209Updated this week
- OCR & Document Extraction using vision modelsโ11,806Updated 3 months ago
- A unified library for object tracking featuring clean room re-implementations of leading multi-object tracking algorithmsโ2,083Updated this week
- Crawl a site to generate knowledge files to create your own custom GPT from a URLโ21,852Updated 2 months ago
- ๐๐ค Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNโ51,866Updated this week
- Run your own AI cluster at home with everyday devices ๐ฑ๐ป ๐ฅ๏ธโโ30,736Updated 5 months ago
- MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phoneโ21,115Updated this week
- Official inference framework for 1-bit LLMsโ21,185Updated 3 months ago
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recordingโ15,558Updated this week
- CoTracker is a model for tracking any point (pixel) on a video.โ4,540Updated 7 months ago
- Python tool for converting files and office documents to Markdown.โ72,874Updated last week
- A simple screen parsing tool towards pure vision based GUI agentโ23,430Updated 2 weeks ago
- ๐ช Create rich visualizations with AIโ13,596Updated last week
- Instant voice cloning by MIT and MyShell. Audio foundation model.โ34,291Updated 4 months ago
- โฉ Ship faster with Continuous AI. Build and run custom agents across your IDE, terminal, and CIโ28,700Updated this week
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures lโฆโ8,313Updated last week
- An open-source RAG-based tool for chatting with your documents.โ23,006Updated 2 months ago
- Build and share delightful machine learning apps, all in Python. ๐ Star to support our work!โ39,719Updated this week
- Open-Sora: Democratizing Efficient Video Production for Allโ27,135Updated 4 months ago
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. โฆโ30,863Updated last week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLโ2,628Updated this week
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.โ39,140Updated last week
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ inโฆโ134,712Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.ไธ็ซๅผๅผๆบ้ซ่ดจ้ๆฐๆฎๆๅๅทฅๅ ท๏ผๅฐPDF่ฝฌๆขๆMarkdownๅJSONๆ ผๅผใโ43,287Updated this week
- The first real AI developerโ33,332Updated 6 months ago
- ๐คฏ Lobe Chat - an open-source, modern design AI chat framework. Supports multiple AI providers (OpenAI / Claude 4 / Gemini / DeepSeek / Oโฆโ65,249Updated this week