jmurth1234 / ClaudePlayerLinks
An AI-powered game playing agent using Claude and PyBoy
☆34Updated 9 months ago
Alternatives and similar repositories for ClaudePlayer
Users that are interested in ClaudePlayer are comparing it to the libraries listed below
Sorting:
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆80Updated this week
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆54Updated 10 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆33Updated 8 months ago
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆63Updated 2 months ago
- Efficient computer use agent powered by Meta Llama 4 Maverick☆45Updated 7 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- Test your local LLMs on the AIME problems☆31Updated 6 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆31Updated 8 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated last year
- ☆24Updated 10 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆48Updated 2 months ago
- Easily view and modify JSON datasets for large language models☆84Updated 6 months ago
- The DPAB-α Benchmark☆32Updated 10 months ago
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words☆164Updated last week
- Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.☆23Updated last year
- ☆50Updated last year
- ☆134Updated 7 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 7 months ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Updated last year
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆28Updated 7 months ago
- LLM backed Fantasy Tribe Game☆19Updated last year
- ☆24Updated last year
- Building synthetic data for preference tuning☆27Updated 11 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆148Updated 9 months ago
- Attend - to what matters.☆17Updated 9 months ago
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆39Updated 8 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆46Updated 3 months ago
- ☆17Updated 11 months ago
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Updated 10 months ago