onebula / Reinforcement_Learning_in_Action
☆20Updated 6 years ago
Related projects: ⓘ
- ☆45Updated 5 years ago
- [动手学强化学习]系列,基于pytorch。☆51Updated 3 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆27Updated 4 years ago
- A collection of multi agent environments based on OpenAI gym.☆20Updated 11 months ago
- 强化学习中纳什Qlearning 实现矩阵博弈☆27Updated 5 years ago
- simple code to reinforcement learning☆20Updated 4 years ago
- 一些利用pytorch编程实现的强化学习例子☆35Updated 5 years ago
- 在PyTorch上重构multi-agent deep deterministic policy gradient(MADDPG),将https://github.com/xuemei-ye/maddpg-mpe 修改到自己电脑上可运行。因为本人笔记本没有CUDA,实验速度…☆13Updated 5 years ago
- Re-produce DQN, REINFORCE, REINFORCE with baseline, one-step AC, QAC, QAC with shared network, PPO2, DDPG, TD3, SAC, SAC discrete,A2C,A3C☆18Updated 4 years ago
- 多智能体强化学习☆80Updated 5 years ago
- ☆24Updated 3 years ago
- 《深度强化学习:原理与实践》,Code of the book <Deep Reinforcement Learning: Principles and Practices>☆148Updated 5 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆57Updated 3 years ago
- ☆24Updated 3 years ago
- scalable multi agents reinforcement learning☆53Updated 6 years ago
- ☆60Updated this week
- reinforcement learning ddpg code. follow deepmind papers.☆60Updated 6 years ago
- ppo+action mask for atari tennis agent☆9Updated last year
- Reinforcement Learning Algorithms Based on PyTorch☆17Updated 2 years ago
- code implementation for 'Bi-level Actor-Critic for Multi-agent Coordination'(AAAI2020)☆53Updated 4 years ago
- 多智能体即时策略对抗方法与实践 苏炯铭 刘鸿福 陈少飞 项凤涛 编著 科学出版社 2019.11 随书代码☆31Updated 3 years ago
- A novel DDPG method with prioritized experience replay (IEEE SMC 2017)☆45Updated 5 years ago
- Pytorch realization of multiple Deep Reinforcement Learning alogrithms(DQN,DDPG,TD3,PPO,A3C...) with openai gym☆53Updated 3 years ago
- 天授中文文档☆55Updated 2 years ago
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆16Updated 6 years ago
- ☆29Updated 5 years ago
- A explaintable and modified version of udacity DRL homework☆26Updated 4 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆70Updated last year
- Heuristic Reinforcement Learning☆11Updated 6 years ago
- 多智能体强化学习(MARL)算法复现,包括QMIX,VDN,QTRAN、MAVEN等等☆173Updated 2 years ago