opendilab / MastermindLinks
[ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis
☆25Updated 9 months ago
Alternatives and similar repositories for Mastermind
Users that are interested in Mastermind are comparing it to the libraries listed below
Sorting:
- Building open-ended embodied agent in battle royale FPS game☆38Updated 2 years ago
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆40Updated 2 years ago
- Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).☆187Updated 11 months ago
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Updated 3 years ago
- Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…☆70Updated last year
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆24Updated 2 years ago
- An RL-Friendly Vision-Language Model for Minecraft☆38Updated last year
- Python library for solving reinforcement learning (RL) problems using generative models.☆11Updated 11 months ago
- Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024…☆42Updated last year
- ☆34Updated 2 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Updated last year
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆67Updated 2 years ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆41Updated last year
- Bayes-Adaptive RL for LLM Reasoning☆45Updated 8 months ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Updated last year
- ☆92Updated 3 years ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆38Updated last year
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Updated 3 years ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆208Updated 3 months ago
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆23Updated 3 months ago
- [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…☆27Updated 7 months ago
- Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)☆14Updated 5 months ago
- ☆31Updated last month
- [NeurIPS-2024] The offical Implementation of "Instruction-Guided Visual Masking"☆41Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction (ICML 2023)☆50Updated 2 years ago
- Enabling Mixed Opponent Strategy Script and Self-play on SMAC☆41Updated 6 months ago
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated last year
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Updated last year
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆24Updated last year
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆82Updated 8 months ago