Pokémon Showdown RL Agents and Datasets
☆115Jun 19, 2026Updated last week
Alternatives and similar repositories for metamon
Users that are interested in metamon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of the spotlight ICML 2025 paper, PokeChamp: an Expert-level Minimax Language Agent.☆174Mar 11, 2026Updated 3 months ago
- off-policy RL on long sequences☆167May 29, 2026Updated last month
- An AI benchmark for Pokémon VGC with agent implementations using multi-agent reinforcement learning, behavior cloning, LLMs, and heuristi…☆45Jun 23, 2026Updated last week
- Reinforcement Learning inside a 3D soccer simulation☆38Sep 15, 2024Updated last year
- ☆27Mar 6, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 3 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- ☆10Jun 27, 2024Updated 2 years ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆84Feb 13, 2025Updated last year
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- ☆13Apr 25, 2024Updated 2 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- ☆24May 20, 2025Updated last year
- ☆23Apr 2, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Challenging Memory-based Deep Reinforcement Learning Agents☆113Oct 27, 2024Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- UNSW's RoboCup Standard Platform League Team☆12Jun 18, 2022Updated 4 years ago
- A minimal and stable PPO.☆148Feb 9, 2024Updated 2 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆13Aug 31, 2020Updated 5 years ago
- ☆47Jan 11, 2024Updated 2 years ago
- [ICLR25] BID-Robot☆68Oct 19, 2025Updated 8 months ago
- Gazebo support for the RoboCup 3D simulation league.☆12May 3, 2020Updated 6 years ago
- ☆29Jul 9, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆113May 12, 2023Updated 3 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆51Dec 21, 2022Updated 3 years ago
- An optimization algorithm for the design of pneumatic soft robots.☆15Jul 30, 2025Updated 11 months ago
- Bipedal Skills Benchmark for Reinforcement Learning☆26Oct 27, 2022Updated 3 years ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆27Dec 21, 2025Updated 6 months ago
- Official codebase for Sirius: Robot Learning on the Job☆66Oct 26, 2023Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆58Mar 6, 2026Updated 3 months ago
- URDFs for the Stretch mobile manipulators from Hello Robot Inc.☆16Jun 5, 2026Updated 3 weeks ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆106Nov 4, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆27May 17, 2026Updated last month
- Unity project and ROS 2 interface definitions for using the HTC Vive☆12Mar 15, 2024Updated 2 years ago
- ☆11Nov 18, 2023Updated 2 years ago
- Evaluation of TD-MPC2.☆21Jan 21, 2024Updated 2 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆30Mar 1, 2024Updated 2 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆19Aug 9, 2024Updated last year
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Jul 21, 2025Updated 11 months ago