OpenAI团队的深度强化学习教程中文版
☆92May 21, 2023Updated 2 years ago
Alternatives and similar repositories for spinningup
Users that are interested in spinningup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenAI团队的深度强化学习教程中文版☆34May 16, 2020Updated 5 years ago
- 用于教学的RL算法仓库,里面放置各种算法的最简单实现,目的是快速理解某个算法☆58May 26, 2025Updated 9 months ago
- ☆16Feb 25, 2026Updated 3 weeks ago
- ☆16Mar 24, 2023Updated 2 years ago
- ☆11Aug 13, 2020Updated 5 years ago
- Collision-Free Mixed-Integer Planning for Quadrotors Using Convex Safe Regions☆16May 16, 2020Updated 5 years ago
- learn from xprog and bertsimas's paper(price of robustness)☆20Jan 17, 2019Updated 7 years ago
- [ECCV'24] Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement☆13May 6, 2025Updated 10 months ago
- A PyTorch implementation of SSINet.☆16Nov 10, 2020Updated 5 years ago
- ☆13May 30, 2019Updated 6 years ago
- Build a bridge that connects beginners to deep reinforcement learning.☆11Sep 23, 2024Updated last year
- ☆12Apr 1, 2025Updated 11 months ago
- Predict direction of forex movement using word2vec and machine learning algorithms.☆10May 10, 2020Updated 5 years ago
- Generates smooth piecewise polynomial trajectories through waypoints☆18Jul 29, 2019Updated 6 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- A standard bare-bone ROS Gazebo simulator for the Franka Emika Panda robot built using inbuilt Gazebo ROS controllers and RobotHW interfa…☆11May 3, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- ☆19Oct 12, 2022Updated 3 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Aug 4, 2020Updated 5 years ago
- Non-local Modeling for Image Quality Assessment☆13Dec 20, 2023Updated 2 years ago
- ☆14Oct 4, 2018Updated 7 years ago
- Solving VRPC with column generation and branch and price for fun and profit☆13Mar 27, 2023Updated 2 years ago
- A zsh plugin to use Esc+P to prepend proxychains (-q) to a command☆12Apr 24, 2016Updated 9 years ago
- Pytorch version of Hinton's Capsule Theory paper: Dynamic Routing Between Capsules☆45Jan 19, 2018Updated 8 years ago
- mobile DFF dataset☆12Nov 26, 2018Updated 7 years ago
- Learned User Representations in Online Social Networks (Twitter) using Temporal Dynamics of Information Diffusion.☆10Oct 15, 2018Updated 7 years ago
- 招聘网站信息监控工具,监控招聘网站工作岗位更新情况并发送通知☆14Feb 9, 2023Updated 3 years ago
- ☆19Jun 30, 2024Updated last year
- DOLPHYN: Decision Optimization for Low Carbon Power and Hydrogen Nexus☆42Updated this week
- mutil_column CNN for crowd counting☆12Dec 13, 2018Updated 7 years ago
- Lark 套件(飞书)Linux 客户端 release。非官方。☆10Jul 3, 2021Updated 4 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- ☆21Jul 2, 2024Updated last year
- ☆11Dec 1, 2024Updated last year
- 中文整理的强化学习资料(Reinforcement Learning)☆2,152Apr 30, 2020Updated 5 years ago
- ☆13Jul 13, 2016Updated 9 years ago
- Machine learning to predict future number Covid19 Daily Cases (7-day moving average). Long Short Term Memory (LSTM) Predictor and Reinfor…☆14Feb 21, 2021Updated 5 years ago