☆13Jun 1, 2020Updated 5 years ago
Alternatives and similar repositories for cartpole_ppo_lstm
Users that are interested in cartpole_ppo_lstm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Personalized Client-Edge-Cloud Hierarchical Federated Learning on Non-IID Data☆11Sep 7, 2023Updated 2 years ago
- A C++ Package for Solving Multiple-Phase Optimal Control Problem Using Adaptive Radau Pseudospectral Methods☆10Aug 31, 2020Updated 5 years ago
- MATLAB code of examples using Gauss pseudospectral method, MS thesis included☆10Sep 18, 2020Updated 5 years ago
- Official Repository for Can Language Models be Instructed to Protect Personal Information?☆13Oct 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ROS packages to control the KUKA LBR iiwa R820 manipulator via KUKA's research interface or in a Gazebo simulation.☆22Nov 5, 2015Updated 10 years ago
- 预测-校正学习计算制导律☆13Jun 22, 2021Updated 4 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Direct Method based on GPOPS☆20Apr 16, 2021Updated 5 years ago
- An AI agent that uses Deep Q-Networks and the DDPG algorithm to learn trajectory optimization in a customized gym environment.☆13Oct 30, 2021Updated 4 years ago
- This is official code for ASFL.☆22Mar 3, 2025Updated last year
- The implement of GAIL with pytorch☆14Mar 11, 2020Updated 6 years ago
- ☆17Oct 25, 2023Updated 2 years ago
- ☆19Nov 21, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Using DDPG agent to control UAV system with energy efficiency☆16Jan 7, 2023Updated 3 years ago
- 2018 RoboCup@Rescue China / 2018 China Robot Competition@Rescue☆13Oct 8, 2019Updated 6 years ago
- Code for paper "Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control"☆18May 26, 2019Updated 6 years ago
- Y. Ling, Y. Zhou and Q. Luo, "Lévy Flight Trajectory-Based Whale Optimization Algorithm for Global Optimization", IEEE Access, vol. 5, pp…☆18Sep 23, 2021Updated 4 years ago
- A python implementation for PILCO algorithm for a robotic arm - tested on mujoco robotics environment☆12Jan 8, 2020Updated 6 years ago
- A basic Python implementation of a Legendre-Gauss-Radau pseudospectral method for computational optimal control.☆16May 9, 2024Updated last year
- Adaptation of DQN, DDQN and COMA for multi-agent Gym environments☆10Oct 3, 2023Updated 2 years ago
- Academic Study of A Multi-Agent Quadrotors (Drones) Simulator with Obstacles and Goals Using the Artificial Potential Field Approach(APF)…☆19Feb 13, 2022Updated 4 years ago
- Simple demo for Databricks!☆14Sep 11, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago
- ☆12Mar 4, 2024Updated 2 years ago
- Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch☆10Aug 2, 2020Updated 5 years ago
- including PPO, SAC, TD3, DDPG, DQN, DDQN, DuelingDQN algorithms☆30Jun 6, 2025Updated 10 months ago
- Determination of optimal spacecraft landing trajectories via convex optimization☆19Jun 6, 2019Updated 6 years ago
- A Missile Guidance System to shoot air objects based on their trajectories using RNN.☆25Feb 19, 2020Updated 6 years ago
- ☆17Jun 23, 2022Updated 3 years ago
- 用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境☆24Jun 12, 2018Updated 7 years ago
- MATLAB implementation of DQN for a navigation environment☆13Aug 13, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Very very simple run on sumo☆13May 14, 2018Updated 7 years ago
- Transfer learning in deep reinforcement learning for continuous control. Implemented DDPG and TD3 algorithms and evaluated ability to ada…☆18Feb 25, 2025Updated last year
- This is a project based on OpenAI's multi-agent-emergence-environments (Emergent Tool Use from Multi-Agent Autocurricula, Baker et al.), …☆13Jan 5, 2021Updated 5 years ago
- 使用掘金量化终端对缠中说禅技术分析理论进行策略研究☆13Feb 25, 2021Updated 5 years ago
- Using N-step dueling DDQN with PER for playing Pacman game☆22Oct 27, 2019Updated 6 years ago
- Hierarchical and Stable Multiagent Reinforcement Learning for Cooperative Navigation Control☆14May 5, 2022Updated 3 years ago
- Notice that there are no torch-related code about item2vec, I just want to provide a readable item2vec implementation for researchers☆20May 12, 2022Updated 3 years ago