strangeloopcanon / llm-pokerLinks
Evaluating LLMs by having them play games against each other
☆20Updated last month
Alternatives and similar repositories for llm-poker
Users that are interested in llm-poker are comparing it to the libraries listed below
Sorting:
- Creating diff that supports wildcard produced by LLMs☆15Updated last year
- Tiny Agent: Production-Ready LLM Agent SDK for Every Developer☆23Updated last week
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆146Updated 2 weeks ago
- 📚 Benchmark your browser agent on ~2.5k READ and ACTION based tasks☆64Updated 2 months ago
- Parallel Reasoning: llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆367Updated this week
- Test your local LLMs on the AIME problems☆31Updated 4 months ago
- George is an API leveraging AI to make it easy to control a computer with natural language.☆50Updated 9 months ago
- 🤖 Headless IDE for AI agents☆202Updated 5 months ago
- Letting Claude Code develop his own MCP tools :)☆122Updated 7 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆66Updated 11 months ago
- A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…☆290Updated last month
- Reddit-Nemesis is a Reddit bot that disagrees to any post, sparking debate and keeping conversations lively.☆49Updated last year
- Open-source software engineer☆181Updated last year
- ☆104Updated 3 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆350Updated 9 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated 11 months ago
- An md file as a chat interface and editable history in one.☆63Updated last month
- A very fast, very minimal prompt optimizer☆288Updated 9 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆42Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆79Updated last year
- Applying the ideas of Deepseek R1 to computer use☆216Updated 8 months ago
- A fork of OpenAI Swarm that supports Groq and Anthropic☆123Updated 7 months ago
- Overide (pronounced over·ide) is a lightweight, yet powerful CLI tool that seamlessly integrates AI-powered code generation into your dev…☆185Updated 2 months ago
- FamilyBench evaluation tool for testing the relational reasoning capabilities of Large Language Models (LLMs).☆36Updated 2 weeks ago
- Moondream MCP Server in Python☆42Updated 3 months ago
- Claude Deep Research config for Claude Code.☆220Updated 6 months ago
- ☆233Updated 7 months ago
- Metadspy: The framework for specifying—not programming—language models☆88Updated 3 months ago
- ☆113Updated 3 months ago
- The easiest, and fastest way to run AI-generated Python code safely☆334Updated 10 months ago