强化学习大作业1 倒立摆
☆20Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for Inverted-Pendulum
Users that are interested in Inverted-Pendulum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 利用强化学习的Q价值迭代,Q学习以及SARSA方法解决小车爬山以及倒立摆的控制问题☆14Jul 25, 2019Updated 6 years ago
- ☆12Oct 13, 2018Updated 7 years ago
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago
- Hybrid Computational Offloading☆16Jul 6, 2022Updated 3 years ago
- 数据科学与人工智能中文讲义☆14Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- the implementation of Q_Learning☆18Jun 12, 2019Updated 6 years ago
- CFR-based Texas Hold'em AI☆11Jan 30, 2021Updated 5 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- Courses in UCAS☆14Jun 12, 2023Updated 2 years ago
- Sim2Real Transfer for Deep Reinforcement Learning with Stochastic State Transition Delays, CORL-2020.☆26Jun 3, 2021Updated 4 years ago
- ☆35Sep 5, 2020Updated 5 years ago
- obstacle avoidance code and algorithms.☆25Sep 2, 2020Updated 5 years ago
- Collision Avoidance simulator for USV using Deep RL. A result of TTK4550 Fordypningsoppgave at NTNU☆21Mar 21, 2024Updated 2 years ago
- ☆16Oct 6, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 本书作者是来自日本的Yutaro Ogawa(小川熊太郎),作者的github上源码是日文注释的,这个repository把它翻译成中文☆22Dec 2, 2020Updated 5 years ago
- This branch contain the java classes for orekit-python-wrapper☆19Aug 3, 2025Updated 7 months ago
- Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games☆24Feb 27, 2022Updated 4 years ago
- 这个仓库用于存储一些强化学习练手小项目与算法实验。具体来讲,就是不至于单独成一个 repo 的项目,但是又值得拿出来讨论的代码。☆26May 27, 2021Updated 4 years ago
- Pytorch implementation of "Learning Domain-Aware Detection Head with Prompt Tuning" (NeurIPS 2023)☆23Mar 6, 2024Updated 2 years ago
- ☆27Jul 11, 2024Updated last year
- Code for "AutoCFR: Learning to Design Counterfatual Regret Minimization Algorithms", AAAI 2022 (Oral)☆22Apr 22, 2024Updated last year
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆29Nov 23, 2024Updated last year
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Sep 25, 2018Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Reinforcement learning☆35Oct 20, 2025Updated 5 months ago
- (NeurIPS 2021) Neural Auto-Curricula in Two-Player Zero-Sum Games.☆28Nov 19, 2021Updated 4 years ago
- OpenAI gym environment for collision avoidance and path following with an AUV☆35Aug 12, 2019Updated 6 years ago
- Reinforcement Learning | Multi-Agent RL | Self-Play | Proximal Policy Optimization Algorithm (PPO) agent | Unity Tennis environment☆20Dec 2, 2025Updated 3 months ago
- Team SINGABOAT-VRX's GitHub Repository for Virtual RobotX (VRX) Competition.☆52Nov 6, 2022Updated 3 years ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆45Sep 19, 2025Updated 6 months ago
- notes☆33Jun 28, 2022Updated 3 years ago
- Master thesis repo for implementation of Model Predictive Controller and reinforcement learning (RL) controller☆48Jan 16, 2024Updated 2 years ago
- 大学期间设计的一款双足仿生人形机器人,满足静态零力距点的步态行走,下位机采用搭载ucos-iii嵌入式试试操作系统的STM32F7,利用三次多项式轨迹规划及逆运动学解算控制机器人运动,上位机为Matlab GUI,通过simulink搭建机器人模型,完成simulink同实…☆58Aug 30, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Instruction Following Agents with Multimodal Transforemrs☆53Nov 3, 2022Updated 3 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- OpenAI gym environment of an Unmanned Surface Vehicle.☆48Apr 6, 2021Updated 4 years ago
- The Continual Learning in Multimodality Benchmark☆68Jun 24, 2023Updated 2 years ago
- 动手学强化学习代码☆65Jan 17, 2024Updated 2 years ago
- 2021-2022国科大强化学习格斗游戏大作业☆37Jun 11, 2022Updated 3 years ago
- 智能自平衡小车,实现平衡功能的基础上,加入了超声波避障、超声波跟随、蓝牙遥控等功能☆43Jan 1, 2025Updated last year