nottelabs / open-operator-evalsLinks
Opensource benchmark evaluating web operators/agents performance
☆45Updated 9 months ago
Alternatives and similar repositories for open-operator-evals
Users that are interested in open-operator-evals are comparing it to the libraries listed below
Sorting:
- Context Engineering Course with DSPy☆211Updated 6 months ago
- Run Surfer-H agents powered by Holo1 using the Surfer-H-CLI. Includes example tasks, scripts, and configurations.☆146Updated last month
- Deep research agents using MiniMax M2.1 interleaved thinking☆194Updated last month
- AI-driven web automation agent that uses Playwright for browser interactions and LLM integration for intelligent decision-making. It's de…☆162Updated 7 months ago
- Vibe-coding tools for the LlamaIndex ecosystem☆176Updated 2 months ago
- An OpenSource Deep Research library with reasoning☆170Updated last month
- An assistant for Slack built with Arcade and Langgraph. Interact with Google Calendar, Mail, Github, Search Engines, Firecrawl and more a…☆117Updated this week
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆146Updated 7 months ago
- ☆62Updated this week
- Simulate conversation between your AI bot and AI user☆46Updated 10 months ago
- ☆89Updated 8 months ago
- DSPy module for OpenAI Codex SDK - signature-driven agentic workflows☆152Updated last month
- ☆149Updated 4 months ago
- Grapheteria: A structured framework bringing uniformity to agent orchestration!☆60Updated 7 months ago
- ☆269Updated last week
- CLI agent to explore file system, powered by Gemini 3 Flash☆126Updated 2 weeks ago
- ☆212Updated last month
- ☆57Updated 5 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆140Updated 5 months ago
- Together Open Deep Research☆356Updated 9 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆308Updated this week
- Graphite Agentic Framework by Binome Technologies☆172Updated last month
- Demo for using copilotkit with the ada-middleware from ag-ui☆83Updated 3 weeks ago
- This open-source project & guide shows you exactly how to implement Canvas UX pattern + LangGraph human-in-the-loop workflows in your AI …☆88Updated 10 months ago
- A collection of Compound Retrieval Systems implemented with DSPy and Weaviate.☆94Updated 3 weeks ago
- ☆85Updated 4 months ago
- ☆94Updated last year
- MCP (Model Context Protocol) server for Weaviate☆161Updated 8 months ago
- Example code and guides for building with Scrapybara☆139Updated 10 months ago
- adapt data to and from every format☆28Updated 3 months ago