cassidylaidlaw / minecraft-building-assistance-gameLinks
☆150Updated 3 weeks ago
Alternatives and similar repositories for minecraft-building-assistance-game
Users that are interested in minecraft-building-assistance-game are comparing it to the libraries listed below
Sorting:
- Benchmark environment for evaluating vision-language models (VLMs) on popular video games!☆291Updated 2 months ago
- Worker to orchestrate and manage running an arbitrary number of LLM-generated builds concurrently using containerized Minecraft Servers.☆169Updated 8 months ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 2 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated 7 months ago
- The State Of The Art, intelligence☆149Updated last week
- ☆82Updated last month
- GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's T…☆218Updated last week
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆109Updated 5 months ago
- ☆166Updated 5 months ago
- LLM/VLM gaming agents and model evaluation through games.☆736Updated this week
- ☆89Updated 6 months ago
- General multi-task deep RL Agent☆184Updated last year
- Train your own SOTA deductive reasoning model☆104Updated 5 months ago
- Automating the Search for Artificial Life with Foundation Models!☆427Updated 7 months ago
- rl from zero pretrain, can it be done? yes.☆193Updated this week
- ☆315Updated 3 months ago
- Interactive timeline of AI history☆58Updated 2 months ago
- GRadient-INformed MoE☆264Updated 10 months ago
- i will automate factorio☆109Updated last year
- ☆91Updated last month
- ☆377Updated last month
- ☆88Updated last month
- ☆142Updated 4 months ago
- Testing baseline LLMs performance across various models☆293Updated 2 weeks ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆72Updated 4 months ago
- ☆17Updated 5 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆457Updated last week
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 6 months ago
- ☆143Updated 5 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆40Updated 4 months ago