A beginner's tutorial of reinforcement learning in both Chinese and English. 一份面向初学者的强化学习教程(中英双语)
☆11Aug 17, 2023Updated 2 years ago
Alternatives and similar repositories for RL101
Users that are interested in RL101 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Webots scene gym environment for drone navigation tasks methods☆13Sep 2, 2025Updated 7 months ago
- ☆22Feb 1, 2024Updated 2 years ago
- 基于CNN的糖尿病视网膜病变识别系统 | Diabetic retinopathy recognition system based on CNN☆17Aug 8, 2020Updated 5 years ago
- deprecated, moved to https://github.com/cggos/ccv☆12Jun 13, 2023Updated 2 years ago
- Vision-RADAR fusion for Robotics BEV Perception: A Survey☆12Jan 28, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Oct 19, 2020Updated 5 years ago
- ☆15May 20, 2025Updated 10 months ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- 尝试用基于值函数逼近的强化学习方法玩经典的马里奥游戏,取得了一定成果☆11Jul 21, 2021Updated 4 years ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- Source code for ICML 2023 paper "Competing for Shareable Arms in Multi-Player Multi-Armed Bandits"☆10May 14, 2024Updated last year
- Framework for measuring sim-to-real gaps in robot joint motions. Supports different humanoids with physics simulation, real hardware data…☆80Feb 5, 2026Updated 2 months ago
- ☆10Sep 21, 2020Updated 5 years ago
- ☆12Jan 6, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code of Paper "Cooperative Sensing and Uploading for Quality-Cost Tradeoff of Digital Twins in VEC", IEEE TCE, 2024.☆12Jul 10, 2023Updated 2 years ago
- ☆18Sep 7, 2023Updated 2 years ago
- ☆15Sep 21, 2020Updated 5 years ago
- Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator☆13Apr 1, 2022Updated 4 years ago
- Microsoft Word Plug-in to support Desktop Publishing: easy updating and positioning of figures and tables.☆13Sep 29, 2025Updated 6 months ago
- Code for my Master's thesis, game theory for adversarial autonomous vehicle platooning scenarios☆14Apr 28, 2023Updated 2 years ago
- deepkoopman的实现☆14Dec 6, 2022Updated 3 years ago
- Deep Recurrent Q-Network with different exploration strategies for self-driving cars (using AirSim)☆10Sep 5, 2024Updated last year
- In this repository, we try to solve musculoskeletal tasks with `Double DQN reinforcement learning` by using a `transformer` model has bee…☆17Nov 7, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Oct 2, 2022Updated 3 years ago
- Application of REINFORCE algorithm to downlink NOMA system☆13Jan 28, 2026Updated 2 months ago
- ☆15Oct 5, 2022Updated 3 years ago
- For my MSc final dissertation "Beamforming Optimization for Reconfigurable Intelligent Surfaces-Assisted Integrated Sensing and Communic…☆17Sep 1, 2024Updated last year
- DQN related algorithms☆10Mar 5, 2023Updated 3 years ago
- pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.☆15Jul 4, 2024Updated last year
- Different path tracking algoritms implemented in ROS.☆11Sep 14, 2021Updated 4 years ago
- ☆15Dec 9, 2021Updated 4 years ago
- ☆14Oct 3, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 新型冠状病毒肺炎(COVID-19)疫情统计数据☆10Apr 5, 2020Updated 6 years ago
- 北大编译课程实践,独立完成的C语言子集SysY编译器,实现了从C语言编译到Koopa IR,再从Koopa IR编译到RISC-V汇编的实现☆34Jul 16, 2024Updated last year
- ☆16Oct 3, 2022Updated 3 years ago
- Code for IEEE GLOBECOM 2023 paper "Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach".☆14May 13, 2024Updated last year
- Reinforced Learning for NS3 in Cognitive Radio spectrum selection☆11Aug 12, 2021Updated 4 years ago
- ROS进阶攻略系列视频课程☆13May 13, 2020Updated 5 years ago
- ☆16Oct 5, 2022Updated 3 years ago