cassidylaidlaw / minecraft-building-assistance-gameLinks
☆151Updated 2 months ago
Alternatives and similar repositories for minecraft-building-assistance-game
Users that are interested in minecraft-building-assistance-game are comparing it to the libraries listed below
Sorting:
- Benchmark environment for evaluating vision-language models (VLMs) on popular video games!☆304Updated 3 months ago
- Worker to orchestrate and manage running an arbitrary number of LLM-generated builds concurrently using containerized Minecraft Servers.☆167Updated 9 months ago
- ☆85Updated 2 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆84Updated this week
- Plotting (entropy, varentropy) for small LMs☆99Updated 4 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆109Updated 6 months ago
- Automated Capability Discovery via Foundation Model Self-Exploration☆63Updated 7 months ago
- LLM/VLM gaming agents and model evaluation through games.☆768Updated last week
- The State Of The Art, intelligence☆152Updated last month
- OSS RL environment + evals toolkit☆176Updated this week
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 6 months ago
- Automating the Search for Artificial Life with Foundation Models!☆427Updated 8 months ago
- SoTA Approach for ARC-AGI 2☆64Updated this week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆340Updated 3 months ago
- General multi-task deep RL Agent☆184Updated last year
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated 9 months ago
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆190Updated last year
- Train your own SOTA deductive reasoning model☆106Updated 6 months ago
- Testing baseline LLMs performance across various models☆309Updated last month
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆254Updated 3 weeks ago
- ☆143Updated 6 months ago
- ☆91Updated 3 months ago
- Official repository for "DynaSaur: Large Language Agents Beyond Predefined Actions"☆350Updated 9 months ago
- explore token trajectory trees on instruct and base models☆132Updated 3 months ago
- ☆17Updated 7 months ago
- i will automate factorio☆111Updated last year
- ☆155Updated 5 months ago
- Build your own visual reasoning model☆408Updated 3 weeks ago
- rl from zero pretrain, can it be done? yes.☆268Updated last month
- Claude Deep Research config for Claude Code.☆214Updated 6 months ago