chenzomi12 / Deep-Reinforcement-Learning
《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>
☆166Updated 5 years ago
Alternatives and similar repositories for Deep-Reinforcement-Learning:
Users that are interested in Deep-Reinforcement-Learning are comparing it to the libraries listed below
- Pytorch for Deep Reinforcement Learning☆243Updated 4 years ago
- Learning Resources And Links Of Reinforcement Learning (updating)☆244Updated 3 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆256Updated 6 years ago
- Tutorial for Reinforcement Learning☆178Updated 3 years ago
- 国立台湾大学李宏毅老师讲解的深度强化学习学习笔记☆137Updated 5 years ago
- 强化学习-中文笔记&资源-以python实例为主-由浅入深☆92Updated 4 years ago
- basic algorithms of reinforcement learning☆205Updated last year
- ☆385Updated 4 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆474Updated 2 years ago
- 我的强化学习笔记和学习材料 still updating ... ...☆338Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆54Updated 3 years ago
- ☆914Updated 2 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆56Updated 3 years ago
- Tutorial4RL: Tutorial for Reinforcement Learning. 强化学习入门教程.☆132Updated 10 months ago
- GitHub's code repository is all you need☆340Updated last year
- ☆159Updated last year
- 天授中文文档☆55Updated 2 months ago
- ☆20Updated 6 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆158Updated 5 years ago
- Reinforcement Learning Algorithm Package & PuckWorld, GridWorld Gym environments☆848Updated 5 years ago
- RL algorithms☆140Updated 3 years ago
- RL-code for beginners. Enjoying!☆111Updated 4 years ago
- 白话强化学习与PyTorch的学习笔记☆33Updated 4 years ago
- ☆45Updated 5 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆73Updated last year
- An easier PyTorch deep reinforcement learning library.☆183Updated last month
- 多智能体强化学习☆87Updated 6 years ago
- Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clari…☆26Updated 8 months ago
- Solve BipedalWalkerHardcore-v2 with TD3☆83Updated last year
- Reinforcement-Learning-Notes, start with MDP.☆221Updated 2 years ago