OpenAI团队的深度强化学习教程中文版
☆35May 16, 2020Updated 5 years ago
Alternatives and similar repositories for spinningup
Users that are interested in spinningup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆14Apr 30, 2023Updated 2 years ago
- PyTorch implementation of the paper Overcoming Exploration in Reinforcement Learning with Demonstrations in surgical robot manipulation t…☆12Aug 21, 2022Updated 3 years ago
- A PyTorch implementation of SSINet.☆16Nov 10, 2020Updated 5 years ago
- ☆16Mar 17, 2024Updated 2 years ago
- 臸娥粂陆亩竟☆10May 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- cc98爬虫☆15Sep 1, 2013Updated 12 years ago
- Modified Pytorch Lightning implementation of paper:-https://jcheminf.biomedcentral.com/track/pdf/10.1186/s13321-019-0407-y☆10Dec 22, 2020Updated 5 years ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- Awsome works based on SSM and Mamba☆16Apr 10, 2024Updated 2 years ago
- opencv调用jetson/rk3588 mpp硬解码,重写了open与read函数,支持h264/h265☆14Nov 27, 2025Updated 4 months ago
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- android compose catalog☆17Jul 4, 2025Updated 9 months ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Apr 26, 2022Updated 3 years ago
- Ossian generic framework☆12Aug 25, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a Matlab and CasADi based Model Predictive Control (MPC) implementation for a kinematic vehicle model. The primary goal of this c…☆16Jun 16, 2023Updated 2 years ago
- GAMMA: A General Agent Motion Prediction Model for Autonomous Driving☆14Nov 17, 2021Updated 4 years ago
- ☆34Mar 24, 2023Updated 3 years ago
- ☆23Feb 24, 2023Updated 3 years ago
- Allegro hand controller package which works with ROS Noetic.☆24May 9, 2024Updated last year
- learning robust rewards with adversarial inverse reinforcement learning☆14Sep 13, 2020Updated 5 years ago
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated last year
- This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…☆19Oct 5, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PSO for Nash Equilibrium. This is the code for my undergraduate thesis.粒子群算法求解纳什均衡☆11Jan 5, 2023Updated 3 years ago
- 完全AI驱动的 DeepWiki ,使用 Go + Eino 技术栈开发☆70Mar 26, 2026Updated 3 weeks ago
- fpv vehicle powered by esp32 cam☆10Aug 9, 2022Updated 3 years ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆23Mar 4, 2026Updated last month
- Collision Avoidance using Buffered Voronoi Cell☆14Feb 10, 2017Updated 9 years ago
- ☆29Oct 10, 2018Updated 7 years ago
- Macro-Action Generator-Critic (MAGIC) - Learning Macro-actions for online POMDP planning☆17Feb 23, 2023Updated 3 years ago
- ☆19Jun 30, 2024Updated last year
- Learning Kinematic Feasibility through Reinforcement Leanring: http://rl.uni-freiburg.de/research/kinematic-feasibility-rl☆24Jan 27, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Maddpg_flight code☆11Jul 4, 2018Updated 7 years ago
- machine learning algorithms source code☆25Jun 8, 2021Updated 4 years ago
- IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)☆23Dec 8, 2022Updated 3 years ago
- Code of the paper "Universal Morphology Control via Contextual Modulation" at ICML 2023☆13Aug 3, 2023Updated 2 years ago
- Code used for the paper 'Lithium-Ion Battery Management System with Reinforcement Learning for Balancing State of Charge and Cell Tempera…☆21May 9, 2023Updated 2 years ago
- object tracking learning☆13Aug 29, 2020Updated 5 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆661Apr 9, 2022Updated 4 years ago