Farama-Foundation / gym-examplesLinks

Example code for the Gym documentation

☆72

Alternatives and similar repositories for gym-examples

Users that are interested in gym-examples are comparing it to the libraries listed below

Sorting:

kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆163Updated last year
schroederdewitt / multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆358Updated 2 years ago
Stable-Baselines-Team / rl-colab-notebooks
Colab notebooks part of the documentation of Stable Baselines reinforcement learning library
☆229Updated 5 months ago
DLR-RM / rl-trained-agents
A collection of pre-trained RL agents using Stable Baselines3
☆130Updated 8 months ago
Stable-Baselines-Team / stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
☆302Updated 2 years ago
semitable / lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
☆183Updated 10 months ago
alirezakazemipour / DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
☆100Updated 2 months ago
Farama-Foundation / MAgent2
An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments
☆298Updated 4 months ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆317Updated 3 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
☆149Updated last year
XinJingHao / PPO-Discrete-Pytorch
A clean and robust Pytorch implementation of PPO on Discrete action space
☆70Updated last year
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆307Updated 11 months ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆362Updated 3 years ago
nikhilbarhate99 / min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…
☆276Updated 3 years ago
Farama-Foundation / MO-Gymnasium
Multi-objective Gymnasium environments for reinforcement learning
☆333Updated 2 weeks ago
dhruvramani / Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆180Updated 2 years ago
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆316Updated 2 years ago
oxwhirl / smacv2
☆254Updated last year
chauncygu / Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
☆180Updated last year
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆178Updated 11 months ago
openai / safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
☆435Updated 2 years ago
williamyuanv0 / Transformer-in-Reinforcement-Learning-for-Decision-Making-A-Survey
Transformer in RL for decision-making
☆96Updated 2 years ago
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆145Updated 3 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated 11 months ago
gxywy / rl-plotter
A plotter for reinforcement learning (RL)
☆226Updated 3 years ago
Jingliang-Duan / DSAC-v1
DSAC; Distributional Soft Actor-Critic
☆129Updated 5 months ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆288Updated 4 years ago
cyanrain7 / TRPO-in-MARL
☆211Updated 2 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year