强化学习大作业1 倒立摆
☆20Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for Inverted-Pendulum
Users that are interested in Inverted-Pendulum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题☆14Jul 25, 2019Updated 6 years ago
- ☆12Oct 13, 2018Updated 7 years ago
- ☆19Sep 6, 2017Updated 8 years ago
- 基于深度强化学习不同算法的移动机器人导航避障☆19Jul 6, 2021Updated 4 years ago
- 旋转倒立摆matlab物理模型仿真☆11Jun 5, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 数据科学与人工智能中文讲义☆14May 13, 2026Updated last week
- the implementation of Q_Learning☆18Jun 12, 2019Updated 6 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- Simple CFD solver for teaching☆18Feb 15, 2011Updated 15 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 7 months ago
- Sim2Real Transfer for Deep Reinforcement Learning with Stochastic State Transition Delays, CORL-2020.☆26Jun 3, 2021Updated 4 years ago
- ☆35Sep 5, 2020Updated 5 years ago
- Collision Avoidance simulator for USV using Deep RL. A result of TTK4550 Fordypningsoppgave at NTNU☆21Mar 21, 2024Updated 2 years ago
- Numerical Simulation of 1-D Sod Shock Tube (MATLAB Codes)☆22Aug 11, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- obstacle avoidance code and algorithms.☆26Sep 2, 2020Updated 5 years ago
- 本书作者是来自日本的Yutaro Ogawa(小川熊太郎),作者的github上源码是日文注释的,这个repository把它翻译成中文☆22Dec 2, 2020Updated 5 years ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆27May 27, 2021Updated 4 years ago
- A simple pythonic implementation of a Riemann solver for the analytic solution of the sod shock tube.☆31Nov 18, 2021Updated 4 years ago
- ☆22May 20, 2021Updated 5 years ago
- ☆28Jul 11, 2024Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Mar 7, 2024Updated 2 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆29Nov 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Sep 25, 2018Updated 7 years ago
- Reinforcement learning☆34Oct 20, 2025Updated 7 months ago
- 非结构化商业文本信息中隐私信息识别比赛代码仓库☆22Jan 11, 2024Updated 2 years ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- [BIBM 2024] XNet v2: Fewer Limitations, Better Results and Greater Universality☆39Oct 2, 2024Updated last year
- OpenAI gym environment for collision avoidance and path following with an AUV☆35Aug 12, 2019Updated 6 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 5 months ago
- Some example programs of DG which will be teached on Bilibili☆37Jul 6, 2023Updated 2 years ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆50Sep 19, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- notes☆34Jun 28, 2022Updated 3 years ago
- Master thesis repo for implementation of Model Predictive Controller and reinforcement learning (RL) controller☆50Jan 16, 2024Updated 2 years ago
- 大学期间设计的一款双足仿生人形机器人,满足静态零力距点的步态行走,下位机采用搭载ucos-iii嵌入式试试操作系统的STM32F7,利用三次多项式轨迹规划及逆运动学解算控制机器人运动,上位机为Matlab GUI,通过simulink搭建机器人模型,完成simulink同实…☆64May 19, 2026Updated last week
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- A simple demo for SAM+MMDetection☆51Apr 12, 2023Updated 3 years ago
- OpenAI gym environment of an Unmanned Surface Vehicle.☆49Apr 6, 2021Updated 5 years ago
- Actor Critic model to play Cartpole game☆53Aug 4, 2018Updated 7 years ago