LyWangPX / Reinforcement_Learning_Coding_ExamplesLinks
My personal practice to implement algorithms of RL from scratch.
☆37Updated 5 years ago
Alternatives and similar repositories for Reinforcement_Learning_Coding_Examples
Users that are interested in Reinforcement_Learning_Coding_Examples are comparing it to the libraries listed below
Sorting:
- Stable-Baselines3 (SB3) reinforcement learning tutorial for the Reinforcement Learning Virtual School 2021.☆54Updated 2 years ago
- Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)☆145Updated 3 years ago
- A curated list of awesome Model-based reinforcement learning resources☆93Updated 5 years ago
- Deep Q-Network (DQN) and Fitted Q-Iteration (FQI) tutorial for RL Summer School 2023☆77Updated 10 months ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.☆28Updated 4 months ago
- Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.☆78Updated 4 years ago
- PyTorch Implementation of Hamilton-Jacobi DQN☆16Updated 4 years ago
- Experiments with reinforcement learning and recurrent neural networks☆114Updated last year
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆104Updated 3 years ago
- Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch fra…☆141Updated 2 years ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games☆32Updated 2 years ago
- Collection of OpenAI parametrized action-space environments.☆66Updated 5 months ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆146Updated 3 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆88Updated last year
- ☆78Updated 6 years ago
- The continuous mountain car problem solved with DDPG☆13Updated 5 years ago
- My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments☆117Updated 2 years ago
- Solving POMDP using Recurrent networks☆91Updated 5 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆187Updated last year
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Updated 5 years ago
- Soft Actor-Critic with advanced features☆51Updated this week
- A collection of recent MARL papers☆96Updated 9 months ago
- Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.☆82Updated 5 years ago
- Prioritized Experience Replay implementation with proportional prioritization☆84Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆151Updated last year
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- DSAC; Distributional Soft Actor-Critic☆131Updated 7 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 4 months ago