lmarena / copilot-arena
☆258Updated this week
Alternatives and similar repositories for copilot-arena:
Users that are interested in copilot-arena are comparing it to the libraries listed below
- ☆183Updated 2 months ago
- agent q - oss advanced reasoning and learning for autonomous ai agents☆395Updated 4 months ago
- ☆296Updated 2 months ago
- A comprehensive set of LLM benchmark scores and provider prices.☆108Updated this week
- the simplest self-building general autonomous agent☆291Updated 4 months ago
- Sidecar is the AI brains for the Aide editor and works alongside it, locally on your machine☆476Updated this week
- Finetune Llama-3-8b on the MathInstruct dataset☆106Updated 4 months ago
- ☆438Updated 4 months ago
- Desktop app powered by Claude’s computer use capability to control your computer☆358Updated 3 weeks ago
- Building open version of OpenAI o1 via reasoning traces (Groq, ollama, Anthropic, Gemini, OpenAI, Azure supported) Demo: https://hugging…☆174Updated 4 months ago
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆202Updated last month
- Flexible and powerful multi-agent AI framework☆340Updated 3 weeks ago
- Examples of using E2B☆853Updated this week
- Automate the analysis of GitHub repositories for LLMs with RepoToTextForLLMs. Fetch READMEs, structure, and non-binary files efficiently.…☆723Updated 8 months ago
- An agent benchmark with tasks in a simulated software company.☆243Updated this week
- Sandboxed code execution for AI agents, locally or on the cloud.☆89Updated this week
- ☆48Updated last year
- ☆168Updated 6 months ago
- the simplest self-building coding agent☆954Updated 4 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆345Updated 3 weeks ago
- FireCrawl MCP Server is a powerful web scraping integration for Claude and other LLMs. It provides JavaScript rendering, batch processing…☆111Updated last week
- E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.☆377Updated last week
- ☆61Updated 3 months ago
- ☆281Updated 8 months ago
- The Multi-Agent Reasoning framework creates an interactive chatbot where AI agents collaborate via structured reasoning and Swarm Integra…☆157Updated 3 weeks ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆411Updated 4 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆58Updated 2 months ago
- ☆158Updated 9 months ago