roboflow / supervisionLinks
We write your reusable computer vision tools. π
β35,876Updated this week
Alternatives and similar repositories for supervision
Users that are interested in supervision are comparing it to the libraries listed below
Sorting:
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,601Updated last month
- An open-source RAG-based tool for chatting with your documents.β24,621Updated 4 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ18,857Updated 3 weeks ago
- Convert PDF to markdown + JSON quickly with high accuracyβ29,799Updated last week
- Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.β155,718Updated this week
- MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phoneβ22,200Updated last month
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.β39,213Updated this week
- computer vision and sportsβ4,695Updated last week
- Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.β42,816Updated last week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.β50,993Updated last week
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,432Updated 6 months ago
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.β48,036Updated last week
- tiny vision language modelβ8,880Updated last month
- A natural language interface for computersβ60,800Updated last week
- π€ Chat with your SQL database π. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval π.β21,583Updated this week
- π The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programmingβ59,419Updated last month
- aider is AI pair programming in your terminalβ38,350Updated last week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recordingβ15,953Updated 2 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β30,866Updated this week
- Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.β35,065Updated this week
- OCR & Document Extraction using vision modelsβ11,948Updated 5 months ago
- πͺ Create rich visualizations with AIβ14,081Updated last week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by stepβ78,286Updated this week
- Ollama Python libraryβ8,827Updated last month
- 21 Lessons, Get Started Building with Generative AIβ101,499Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β115,188Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,022Updated last week
- π Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.β66,208Updated last week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creatβ¦β67,476Updated this week
- Turn any computer or edge device into a command center for your computer vision projects.β2,036Updated this week