cassidylaidlaw / minecraft-building-assistance-gameLinks
☆140Updated last month
Alternatives and similar repositories for minecraft-building-assistance-game
Users that are interested in minecraft-building-assistance-game are comparing it to the libraries listed below
Sorting:
- Benchmark environment for evaluating vision-language models (VLMs) on popular video games!☆262Updated last week
- Worker to orchestrate and manage running an arbitrary number of LLM-generated builds concurrently using containerized Minecraft Servers.☆169Updated 6 months ago
- ☆68Updated last month
- Plotting (entropy, varentropy) for small LMs☆97Updated 2 weeks ago
- General multi-task deep RL Agent☆183Updated last year
- ☆86Updated 4 months ago
- Claude Deep Research config for Claude Code.☆181Updated 2 months ago
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆119Updated 2 months ago
- ☆128Updated 2 months ago
- prime-rl is a codebase for decentralized async RL training at scale☆318Updated this week
- Letting Claude Code develop his own MCP tools :)☆107Updated 2 months ago
- ⚖️ Awesome LLM Judges ⚖️☆104Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 4 months ago
- i will automate factorio☆105Updated 10 months ago
- A graph visualization of attention☆55Updated 2 weeks ago
- ☆120Updated 5 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆173Updated this week
- ☆264Updated this week
- explore token trajectory trees on instruct and base models☆125Updated last week
- ☆151Updated 3 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆108Updated 3 months ago
- ☆303Updated last month
- Challenges for general-purpose web-browsing AI agents☆58Updated this week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- Train your own SOTA deductive reasoning model☆93Updated 3 months ago
- Coding problems used in aider's polyglot benchmark☆131Updated 5 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆354Updated 3 weeks ago
- Scale your LLM-as-a-judge.☆234Updated last week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆477Updated 3 weeks ago
- Open Agent Computer Interface☆71Updated 6 months ago