roboflow / supervisionLinks
We write your reusable computer vision tools. π
β36,436Updated this week
Alternatives and similar repositories for supervision
Users that are interested in supervision are comparing it to the libraries listed below
Sorting:
- An open-source RAG-based tool for chatting with your documents.β24,990Updated 7 months ago
- tiny vision language modelβ9,303Updated 2 months ago
- computer vision and sportsβ4,873Updated 3 months ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.β54,105Updated this week
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"β7,042Updated 10 months ago
- A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures lβ¦β9,142Updated this week
- A natural language interface for computersβ62,041Updated 2 months ago
- Build multi-agent systems that learn and improve with every interaction.β37,479Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,861Updated 4 months ago
- Trackers gives you clean, modular re-implementations of leading multi-object tracking algorithms released under the permissive Apache 2.0β¦β2,389Updated this week
- Turn any computer or edge device into a command center for your computer vision projects.β2,183Updated this week
- CoTracker is a model for tracking any point (pixel) on a video.β4,806Updated last year
- real time face swap and one-click video deepfake with only a single imageβ79,252Updated this week
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, alβ¦β16,679Updated this week
- [ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed forβ¦β5,527Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,228Updated this week
- An autonomous agent that conducts deep research on any data using any LLM providers.β25,156Updated last week
- π€ Chat with your SQL database π. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval π.β22,496Updated this week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecβ¦β18,430Updated this week
- The first real AI developerβ33,761Updated 2 months ago
- A framework to enable multimodal models to operate a computer.β10,131Updated 4 months ago
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-gβ¦β42,616Updated this week
- StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generationβ10,592Updated last year
- OCR & Document Extraction using vision modelsβ12,070Updated 8 months ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β57,533Updated this week
- π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programmingβ63,818Updated 2 weeks ago
- Official inference repo for FLUX.1 modelsβ25,187Updated 6 months ago
- Self-hosted AI coding assistantβ32,849Updated this week
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convertβ¦β24,076Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,659Updated last week