mind-games-challenge / mindgames-starter-kitLinks
The official starter-kit for NeurIPS 2025 mind games competition
☆21Updated 3 months ago
Alternatives and similar repositories for mindgames-starter-kit
Users that are interested in mindgames-starter-kit are comparing it to the libraries listed below
Sorting:
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- Dateset Reset Policy Optimization☆31Updated last year
- Official Repo of LangSuitE☆84Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆23Updated 2 weeks ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆47Updated 9 months ago
- ☆73Updated last year
- implementation of dualformer☆24Updated 8 months ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆11Updated last year
- ☆29Updated 3 months ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆59Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆34Updated 3 months ago
- Bayes-Adaptive RL for LLM Reasoning☆40Updated 5 months ago
- Codebase for "On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback". This repo implements a generative multi-tur…☆21Updated 10 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated 2 years ago
- ☆63Updated 7 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆38Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 7 months ago
- Official repository of the NeurIPS 2025 Competition: The PokeAgent Challenge: Competitive and Long-Context Learning at Scale. (Track 2, S…☆54Updated this week
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated last year
- ☆25Updated 7 months ago
- Natural Language Reinforcement Learning☆99Updated 3 months ago
- Verlog: A Multi-turn RL framework for LLM agents☆63Updated last week
- Code for "Interactive Task Planning with Language Models"☆32Updated 6 months ago
- A vast array of Multi-Modal Embodied Robotic Foundation Models!☆26Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Updated last year
- Official implementation of paper "ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting" (CVPR 2025)☆45Updated 6 months ago
- Resa: Transparent Reasoning Models via SAEs☆44Updated last month
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆20Updated 3 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated last year
- Measuring General Intelligence With Generated Games (Preprint)☆26Updated 3 months ago