dave1010 / ceo-benchLinks
CEO Bench is a comprehensive evaluation framework measuring how well Large Language Models perform on executive-level decision making, strategic planning, and leadership tasks.
☆18Updated 7 months ago
Alternatives and similar repositories for ceo-bench
Users that are interested in ceo-bench are comparing it to the libraries listed below
Sorting:
- A SQL-like language for efficient code analysis and transformations☆35Updated last year
- A simple, observable code-writing agent builder in TypeScript.☆30Updated 10 months ago
- powerful and fast tool calling agents☆80Updated 10 months ago
- Apex: An advanced autonomous coding agent for VS Code featuring total autonomy modes, recursive chain-of-thought reasoning, council-of-cr…☆31Updated last week
- Torque is a Declarative, typesafe DSL for building synthetic LLM datasets — compose conversations like React components☆87Updated 2 months ago
- All-in-one MCP server that can connect your AI agents to any native endpoint, powered by UTCP☆190Updated 2 months ago
- ☆24Updated 8 months ago
- This project provides tools that expose Language Server Protocol (LSP) functionality as MCP (Model Context Protocol) tools☆28Updated 8 months ago
- George is an API leveraging AI to make it easy to control a computer with natural language.☆50Updated last year
- A web application that converts speech to speech 100% private☆81Updated 8 months ago
- FamilyBench evaluation tool for testing the relational reasoning capabilities of Large Language Models (LLMs).☆40Updated 4 months ago
- ☆49Updated 11 months ago
- GoalChain for goal-orientated LLM conversation flows☆71Updated last year
- Compose, manage, and run MCP servers as Docker containers. With a Unified API gateway built in.☆53Updated 4 months ago
- Unofficial documentation for GitHub Spark, generated with GitHub Spark☆96Updated 3 months ago
- A lightweight Agentic AI framework which works for Mac/Linux/WSL☆45Updated 7 months ago
- Building on Anthropic's Circuit Tracer, Neuronpedia, Ameisen et al. (2025) and Lindsey et al. (2025), we attempt to extend the paradigm w…☆61Updated 6 months ago
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆38Updated 2 months ago
- mcp-use is the framework for MCP with the best DX - Build AI agents, create MCP servers with UI widgets, and debug with built-in inspec…☆167Updated 2 months ago
- OmniMCP uses Microsoft OmniParser and Model Context Protocol (MCP) to provide AI models with rich UI context and powerful interaction cap…☆69Updated 10 months ago
- story based implementation for sequential thinking☆15Updated last month
- Memory that learns what works.☆109Updated 2 weeks ago
- This is a TypeScript package to add tool calling capabilities to newly released LLMs on LangChain.js's ChatOpenAI and BaseChatModel class…☆19Updated 8 months ago
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- ☆49Updated 10 months ago
- Shared Memory Storage for Multi-Agent Systems☆139Updated 7 months ago
- A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.☆45Updated 2 weeks ago
- Vector functions and indexing for SQLite☆10Updated 2 years ago
- A modular framework for building massively parallel agentic systems☆29Updated 5 months ago
- ☆20Updated 9 months ago