Pokémon Showdown RL Agents and Datasets
☆110Jun 1, 2026Updated last week
Alternatives and similar repositories for metamon
Users that are interested in metamon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- off-policy RL on long sequences☆163May 29, 2026Updated last week
- Reinforcement Learning inside a 3D soccer simulation☆38Sep 15, 2024Updated last year
- ☆27Mar 6, 2025Updated last year
- MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]☆16May 17, 2023Updated 3 years ago
- Clean, extensible implementation of MACAW [ICML 2021]☆12Dec 7, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Language-consistent Open Relation Extraction Model.☆16Mar 24, 2023Updated 3 years ago
- ☆10Jun 27, 2024Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆84Feb 13, 2025Updated last year
- ☆13Apr 25, 2024Updated 2 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- ☆24May 20, 2025Updated last year
- ☆23Apr 2, 2024Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆113Oct 27, 2024Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- CEVAE with VampPrior☆11Jul 18, 2018Updated 7 years ago
- UNSW's RoboCup Standard Platform League Team☆12Jun 18, 2022Updated 3 years ago
- A minimal and stable PPO.☆147Feb 9, 2024Updated 2 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆13Aug 31, 2020Updated 5 years ago
- ☆47Jan 11, 2024Updated 2 years ago
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆18Feb 21, 2025Updated last year
- [ICLR25] BID-Robot☆66Oct 19, 2025Updated 7 months ago
- Gazebo support for the RoboCup 3D simulation league.☆12May 3, 2020Updated 6 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆113May 12, 2023Updated 3 years ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆51Dec 21, 2022Updated 3 years ago
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆18Jan 16, 2025Updated last year
- COMMS Software for UPSat☆12Dec 17, 2018Updated 7 years ago
- An optimization algorithm for the design of pneumatic soft robots.☆14Jul 30, 2025Updated 10 months ago
- ☆21Jul 25, 2025Updated 10 months ago
- Bipedal Skills Benchmark for Reinforcement Learning☆25Oct 27, 2022Updated 3 years ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆26Dec 21, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official codebase for Sirius: Robot Learning on the Job☆67Oct 26, 2023Updated 2 years ago
- PyTorch reimplementation for "LO-Net: Deep Real-time Lidar Odometry" https://arxiv.org/abs/1904.08242☆16Jan 8, 2022Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated 3 months ago
- URDFs for the Stretch mobile manipulators from Hello Robot Inc.☆16Updated this week
- Data generation code for Ditto☆11Apr 28, 2022Updated 4 years ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆102Nov 4, 2025Updated 7 months ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆25May 17, 2026Updated 3 weeks ago