taoyafan / Master_Graduation_ProjectLinks
硕士毕设,基于强化学习的足型机器人运动控制
☆11Updated 6 months ago
Alternatives and similar repositories for Master_Graduation_Project
Users that are interested in Master_Graduation_Project are comparing it to the libraries listed below
Sorting:
- [动手学强化学习]系列,基于pytorch。☆55Updated 4 years ago
- Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow☆28Updated 5 years ago
- 多智能体强化学习☆98Updated 6 years ago
- Code for the paper “Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning”☆24Updated 2 years ago
- Stable Baselines官方文档中文版☆97Updated 4 years ago
- 一些利用pytorch编程实现的强化学习例子☆36Updated 6 years ago
- Solve BipedalWalkerHardcore-v2 with TD3☆89Updated 2 years ago
- Tutorial for Reinforcement Learning☆184Updated 3 years ago
- shouyuantianxia / Algorithmic-Game-Theory-Application-on-Multi-agent-Combat-and-Verification-Platform-Design本科毕业设计:《多智能体博弈兵棋推演理论与验证平台设计》的源代码附录内容。强化学习算法的实现上参考了周沫凡先生的开源代码https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow☆55Updated 4 years ago
- My DRL library with tensorflow1.14 based on openai spinning-up☆61Updated 4 years ago
- Alignment成为GPT类大模型微调的必须环节,深度强化学习是Alignment的核心。本项目是一个支持非gym环境训练、支持可视化配置的深度强化学习应用编程框架,30分钟上手强化学习编程。☆73Updated 2 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆262Updated 6 years ago
- 强化学习相关知识的学习,Q学习和SARSA以及后面的DQN,有用到路径规划方面的,也有实际小迷宫的案例☆34Updated 6 years ago
- 使用pytorch构建深度强化学习模型DQN☆25Updated 7 years ago
- This is a reinforcement learning algorithm library. The code takes into account both performance and simplicity, with little dependence.☆100Updated 2 years ago
- ☆20Updated 7 years ago
- RL-code for beginners. Enjoying!☆115Updated 5 years ago
- ☆390Updated 4 years ago
- reinforcement learning ddpg code. follow deepmind papers.☆60Updated 7 years ago
- 《强化学习-原理与Python实现》的Pytorch实现。☆60Updated 4 years ago
- Practice of Deep Reinforcement Learning with Keras and gym.☆158Updated 6 years ago
- Python版本 盖尔-沙普利算法:为了寻找一个稳定匹配而设计出的市场机制,对于市场的一方A有m个对象 另一方市场B有n个对象,A中的每个人会对B中的 人有一个好感度排名,B也会有相应的排名 市场一方的对象 Ai,i=1,2,...,m 向另一方的对象 Bj,j=1,2…☆9Updated 5 years ago
- 路径规划强化学习☆50Updated 4 years ago
- 多智能体学习库☆19Updated 3 years ago
- cartpole的多种控制方法(强化学习、自适应pid、粒子群)☆18Updated 6 years ago
- 白话强化学习与PyTorch的学习笔记☆35Updated 5 years ago
- Simple Reinforcement learning tutorials☆15Updated 5 years ago
- 利用深度强化学习的方法实现多智能体间离散无交流的障碍避免。其中强化学习算法训练模型所需的数据集由最优互惠碰撞避免(Optimal Reciprocal Collision Avoidance, ORCA)算法生成。☆85Updated 6 years ago
- Demonstrate the Q-Learning approach for AGV path planning☆42Updated 6 years ago
- Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clari…☆30Updated last year