fakemonk1 / Reinforcement-Learning-Lunar_LanderLinks

☆48

Alternatives and similar repositories for Reinforcement-Learning-Lunar_Lander

Users that are interested in Reinforcement-Learning-Lunar_Lander are comparing it to the libraries listed below

Sorting:

philtabor / Actor-Critic-Methods-Paper-To-Code
☆184Updated 3 years ago
BY571 / Deep-Reinforcement-Learning-Algorithm-Collection
Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.
☆77Updated 4 years ago
Stable-Baselines-Team / rl-colab-notebooks
Colab notebooks part of the documentation of Stable Baselines reinforcement learning library
☆229Updated 5 months ago
deepanshut041 / Reinforcement-Learning
Implementations of Deep Reinforcement Learning Algorithms and Bench-marking with PyTorch
☆135Updated 5 years ago
abhisheksuran / Reinforcement_Learning
Deep Reinforcement Learning Algorithms implemented with Tensorflow 2.3
☆99Updated 2 years ago
semitable / robotic-warehouse
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
☆377Updated 10 months ago
DLR-RM / rl-trained-agents
A collection of pre-trained RL agents using Stable Baselines3
☆130Updated 8 months ago
DeUmbraTX / practical_rllib_tutorial
Practical tutorial on RLlib for deep hierarchical multi-agent reinforcement learning
☆65Updated 2 years ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆123Updated 4 years ago
PacktPublishing / Tensorflow-2-Reinforcement-Learning-Cookbook
Tensorflow 2 Reinforcement Learning Cookbook, published by Packt
☆195Updated 2 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆126Updated 5 years ago
philtabor / Deep-Q-Learning-Paper-To-Code
☆412Updated 2 years ago
schneimo / ddpg-pytorch
PyTorch implementation of DDPG for continuous control tasks.
☆46Updated 5 years ago
ChuaCheowHuan / reinforcement_learning
My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
☆37Updated 2 years ago
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆288Updated 4 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago
shivaverma / Orbit
Open source collection of Reinforcement Learning Environments.
☆76Updated 2 years ago
philtabor / Advanced-Replay-Strategies
☆13Updated 2 years ago
colinskow / move37
Coding Demos from the School of AI's Move37 Course
☆184Updated 6 years ago
philtabor / intrinsic-curiosity-paper-to-code
PyTorch implementation of the intrinsic curiosity module (ICM) and A3C a;lgorithm
☆23Updated 3 years ago
marek-robak / Drone-2d-custom-gym-env-for-reinforcement-learning
OpenAI Gym environment designed for training RL agents to control the flight of a two-dimensional drone.
☆52Updated 3 years ago
addy1997 / Gridworld
OpenAI gym-based algorithm for the grid world problem
☆28Updated 4 years ago
ArnaudFickinger / gym-multigrid
Lightweight multi-agent gridworld Gym environment
☆208Updated last year
Guillaume-Cr / lunar_lander_per
☆17Updated 5 years ago
jsztompka / MultiAgent-PPO
Proximal Policy Optimization with Beta distribution - uses multi agent Unity ML Tennis
☆29Updated 6 years ago
shivaverma / OpenAIGym
Solving OpenAI Gym problems.
☆187Updated 4 years ago
marctuscher / DRQN-tensorflow
Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro
☆175Updated 2 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆317Updated 3 years ago
cyoon1729 / RLcycle
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
☆300Updated last year
Ullar-Kask / TD3-PER
An implementation of deep reinforcement learning TD3 algorithm with prioritized experience replay (PER) buffer
☆24Updated 5 years ago