nottelabs / open-operator-evalsLinks
Opensource benchmark evaluating web operators/agents performance
☆27Updated last month
Alternatives and similar repositories for open-operator-evals
Users that are interested in open-operator-evals are comparing it to the libraries listed below
Sorting:
- An assistant for Slack built with Arcade and Langgraph. Interact with Google Calendar, Mail, Github, Search Engines, Firecrawl and more a…☆87Updated 2 months ago
- A list of AI memory projects☆108Updated 4 months ago
- An application built on the Model Context Protocol (MCP) that transforms any website into highly relevant content based on your queries. …☆60Updated last month
- ☆40Updated last month
- ☆16Updated 7 months ago
- A MCP server connecting to managed indexes on LlamaCloud☆76Updated last month
- Collection of impressive LLM apps with a focus on the financial sector☆57Updated 2 months ago
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆132Updated last month
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆92Updated this week
- ☆91Updated this week
- ☆86Updated 3 weeks ago
- ☆86Updated 4 months ago
- Simulate conversation between your AI bot and AI user☆39Updated 2 months ago
- Build a Recommendation System Agent using LATS Agent Approach☆30Updated 3 months ago
- Backend-as-a-Service for AI Agents. Equip any AI Agent with tools, memory, multi-agent collaboration, state, triggering, storage, and mor…☆224Updated last week
- A repository Payman + Langgraph integration examples that allow AI Agent to simply create tasks for Humans on Payman that pay them money …☆83Updated 7 months ago
- ☆41Updated 2 months ago
- Model Context Protocol (MCP) Server for Langfuse Prompt Management. This server allows you to access and manage your Langfuse prompts thr…☆92Updated 3 months ago
- A MCP server that provides web search capabilities using the Claude API.☆38Updated 3 weeks ago
- An interactive integration of yFiles for HTML with LlamaIndex to visualize the knowledge graph used for query resolution.☆48Updated 2 months ago
- PandaAGI provides a simple, intuitive API for building general AI agents in just a few lines of code☆121Updated this week
- ☆43Updated 6 months ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzu☆44Updated 2 weeks ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆39Updated 4 months ago
- ☆99Updated 4 months ago
- ☆96Updated 3 months ago
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆57Updated last week
- An AI Coding Agent Powered by LangGraph☆99Updated this week
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆61Updated 10 months ago
- ☆68Updated 3 months ago