DHDev0 / Muzero-unpluggedView external linksLinks
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
☆35Jun 25, 2025Updated 7 months ago
Alternatives and similar repositories for Muzero-unplugged
Users that are interested in Muzero-unplugged are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆75Dec 31, 2025Updated last month
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.☆17Oct 15, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- [IEEE ToG] MiniZero: An AlphaZero and MuZero Training Framework☆121Updated this week
- A C++ pytorch implementation of MuZero☆40May 1, 2024Updated last year
- ☆22Aug 10, 2022Updated 3 years ago
- Risk Management via Anomaly Circumvent: Mnemonic Deep Learning for Midterm Stock Prediction. KDD 2019.☆23Aug 26, 2020Updated 5 years ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Dec 15, 2025Updated 2 months ago
- D ratio is a performance metric to analyse the efficiency of algorithms that predict asset return or asset prices☆25Feb 22, 2024Updated last year
- [ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data☆101Aug 9, 2024Updated last year
- Action Value Gradient Algorithm☆28May 18, 2025Updated 8 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 5 months ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 3 years ago
- PyTorch implementation of the implicit Q-learning algorithm (IQL)☆44Dec 17, 2021Updated 4 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- reinforcement learning, deep Q-network, double DQN, dueling DQN, prioritized experience replay☆31May 22, 2018Updated 7 years ago
- An assemble of various world model including dreamer v2 and v3☆10Sep 9, 2023Updated 2 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- The official starter-kit for NeurIPS 2025 mind games competition☆21Jul 27, 2025Updated 6 months ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Various transformers for FSDP research☆38Nov 11, 2022Updated 3 years ago
- Research project implementation for the ICAIF'21 publication and Master's Thesis. ITS-SentARL => Intelligent Trading Systems: A Sentiment…☆41Sep 8, 2024Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆163Jun 23, 2023Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆923Dec 20, 2023Updated 2 years ago
- Advantage Alignment Algorithms (ICLR 2025 oral)☆16Apr 7, 2025Updated 10 months ago
- ☆10Jul 21, 2019Updated 6 years ago
- Open Source Tsetlin Machine framework☆17Oct 15, 2018Updated 7 years ago
- Introductory jupyter notebook for various RL concept☆11Oct 13, 2024Updated last year
- WIP: Python client for Liftbridge.☆10Jul 5, 2020Updated 5 years ago
- ☆10Dec 17, 2019Updated 6 years ago
- ☆15Jul 27, 2023Updated 2 years ago
- Implementations of the renormalization group-based diffusion model (RGDM).☆16Mar 10, 2025Updated 11 months ago
- A smart web crawler built in Rust that uses Claude AI to select the most relevant URLs from website sitemaps based on crawling objectives…☆19Jul 9, 2025Updated 7 months ago
- Partially Observable Multi-Agent RL with Transformers☆17Updated this week
- (NeurIPS 2025) LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search☆20Feb 3, 2026Updated last week
- prinzbench is a private benchmark that ranks LLMs based on their ability to conduct legal research and analysis and locate obscure public…☆32Feb 8, 2026Updated last week