theredsix / cerebellumView external linksLinks
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
β854Nov 10, 2025Updated 3 months ago
Alternatives and similar repositories for cerebellum
Users that are interested in cerebellum are comparing it to the libraries listed below
Sorting:
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web witβ¦β6,395Jan 28, 2026Updated 2 weeks ago
- Automate browser based workflows with AIβ20,399Updated this week
- β3,497Nov 15, 2024Updated last year
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the webβ2,334Jun 9, 2025Updated 8 months ago
- Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)β1,159Oct 30, 2024Updated last year
- Driving all platforms UI automation with vision-based modelβ11,647Feb 9, 2026Updated last week
- The AI Browser Automation Frameworkβ21,077Updated this week
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, alβ¦β16,810Updated this week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ567Nov 20, 2025Updated 2 months ago
- MemFree - Hybrid AI Search Engine & AI Page Generatorβ1,484Aug 8, 2025Updated 6 months ago
- β¨ The open-source no-code platform for web scraping, crawling, search and AI data extraction β’ Turn websites into structured APIs in minuβ¦β14,897Updated this week
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically drβ¦β220Nov 3, 2024Updated last year
- Desktop app to control your computer with AI using your terminal, browser, mouse & keyboardβ567Jan 9, 2026Updated last month
- π Make websites accessible for AI agents. Automate tasks online with ease.β78,295Updated this week
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.β4,188Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,735Jan 9, 2026Updated last month
- The first open-source agent skills builder. Define skills by vibe workflow, run on Claude Code, Cursor, Codex & more. Build Clawdbot π¦Β· β¦β6,617Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,976Dec 8, 2025Updated 2 months ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.β1,711Jan 20, 2026Updated 3 weeks ago
- Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Oβ¦β12,220Nov 24, 2025Updated 2 months ago
- A simple screen parsing tool towards pure vision based GUI agentβ24,364Sep 12, 2025Updated 5 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready dataβ2,938Sep 24, 2025Updated 4 months ago
- OCR & Document Extraction using vision modelsβ12,136May 20, 2025Updated 8 months ago
- Fuji is an AI agent that lives in your browser's sidepanel. You can now get tasks done online with a single command!β585Jan 6, 2026Updated last month
- βοΈ Create and run workflows (RPA 2.0)β3,880Feb 6, 2026Updated last week
- The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.β4,539Updated this week
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β6,762Jul 4, 2025Updated 7 months ago
- A visual playground for agentic workflows: Iterate over your agents 10x fasterβ5,674Jul 20, 2025Updated 6 months ago
- Open Source framework for voice and multimodal conversational AIβ10,263Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β5,359Dec 31, 2025Updated last month
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ6,796Dec 12, 2025Updated 2 months ago
- Large Action Model framework to develop AI Web Agentsβ6,295Jan 21, 2025Updated last year
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.β541Nov 3, 2025Updated 3 months ago
- An AI personal tutor built with Llama 3.1β1,955Dec 15, 2025Updated 2 months ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, metβ¦β1,572Jan 20, 2025Updated last year
- Riona Ai Agent πΈ is built using Node.js and TypeScript π οΈ, designed for seamless job execution πΈ. It's lightweight, efficient, and stiβ¦β4,177Dec 22, 2025Updated last month
- first base model for full-duplex conversational audioβ1,773Jan 5, 2025Updated last year
- Generate descriptions from product images in multiple languages with AIβ325Jan 20, 2025Updated last year
- Gemini Multimodal Live + WebRTC in a single `app.ts`β212Oct 14, 2025Updated 4 months ago