This tool has been deprecated. Use Agentic Document Extraction instead.
☆5,275Jan 29, 2026Updated 3 months ago
Alternatives and similar repositories for vision-agent
Users that are interested in vision-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The python library for real-time communication☆4,587Jan 12, 2026Updated 3 months ago
- This tool has been deprecated. Use Agentic Document Extraction instead.☆48Feb 5, 2026Updated 3 months ago
- Run agents as production software.☆39,835Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,231Mar 25, 2026Updated last month
- 🪄 Create rich visualizations with AI☆15,247Updated this week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A simple screen parsing tool towards pure vision based GUI agent☆24,714Apr 13, 2026Updated 3 weeks ago
- An autonomous agent that conducts deep research on any data using any LLM providers☆26,806Apr 16, 2026Updated 2 weeks ago
- Universal memory layer for AI Agents☆54,714Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆28,150Sep 30, 2025Updated 7 months ago
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,557Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆50,629Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆92,144Updated this week
- Simple, unified interface to multiple Generative AI providers☆13,749Dec 15, 2025Updated 4 months ago
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆16,869Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fully local web research and report writing assistant☆9,083Apr 21, 2026Updated 2 weeks ago
- Spec-driven development for large codebases☆5,358Apr 29, 2026Updated last week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,494Apr 27, 2026Updated last week
- 🙌 OpenHands: AI-Driven Development☆72,542Updated this week
- Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.☆23,499Oct 28, 2025Updated 6 months ago
- A framework for building realtime voice AI agents 🤖🎙️📹☆10,353Updated this week
- Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆63,536Updated this week
- A programming framework for agentic AI☆57,588Apr 15, 2026Updated 3 weeks ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆21,427Apr 15, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Agent S: an open agentic framework that uses computers like a human☆11,011Feb 21, 2026Updated 2 months ago
- A fast multimodal LLM for real-time voice☆4,412Dec 12, 2025Updated 4 months ago
- Build Real-Time Knowledge Graphs for AI Agents☆25,612Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆64,964Updated this week
- 🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming☆67,611Jan 21, 2026Updated 3 months ago
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆7,601Apr 24, 2026Updated last week
- Open-source framework for conversational voice AI agents☆10,462Updated this week
- The Autonomous Company Operating System☆19,705Updated this week
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆5,512Mar 19, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆22,391Apr 12, 2026Updated 3 weeks ago
- An open-source RAG-based tool for chatting with your documents.☆25,350Apr 3, 2026Updated last month
- PraisonAI 🦞 — Hire a 24/7 AI Workforce. Stop writing boilerplate and start shipping autonomous self-improving agents that research, plan…☆7,013Updated this week
- We write your reusable computer vision tools. 💜☆38,290Updated this week
- DSPy: The framework for programming—not prompting—language models☆34,180Updated this week
- Open-source text-to-SQL and text-to-chart GenBI agent with a semantic layer. Ask your database questions in natural language — get accura…☆15,061Updated this week
- Automate browser based workflows with AI☆21,491Updated this week