YangShengqi/cartpole_ppo_lstm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YangShengqi/cartpole_ppo_lstm)

YangShengqi / cartpole_ppo_lstm

☆13

Alternatives and similar repositories for cartpole_ppo_lstm

Users that are interested in cartpole_ppo_lstm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cvigoe / DRL4MAAS
View on GitHub
Code for paper "Multi-Agent Active Search: a Reinforcement Learning Approach", submitted to ICRA 2022.
☆13Sep 19, 2021Updated 4 years ago
SimRey / HPPO
View on GitHub
☆11Sep 13, 2025Updated 10 months ago
jaromiru / sr-drl
View on GitHub
Implementation of Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks
☆27Aug 24, 2023Updated 2 years ago
xiangqianL / ESPerHFL
View on GitHub
Personalized Client-Edge-Cloud Hierarchical Federated Learning on Non-IID Data
☆11Sep 7, 2023Updated 2 years ago
eddyxzc / lpopc
View on GitHub
A C++ Package for Solving Multiple-Phase Optimal Control Problem Using Adaptive Radau Pseudospectral Methods
☆10Aug 31, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
rtkg / lbr_iiwa
View on GitHub
ROS packages to control the KUKA LBR iiwa R820 manipulator via KUKA's research interface or in a Gazebo simulation.
☆22Nov 5, 2015Updated 10 years ago
LutterWa / Computational-TSG
View on GitHub
预测-校正学习计算制导律
☆13Jun 22, 2021Updated 5 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
View on GitHub
Baseline implementation of recurrent PPO using truncated BPTT
☆161Apr 28, 2024Updated 2 years ago
BUPT-ANTlab / PEPCRL-MVP
View on GitHub
☆17Oct 25, 2023Updated 2 years ago
byronbenharris / reinforcement-learning-trajectory-optimization
View on GitHub
An AI agent that uses Deep Q-Networks and the DDPG algorithm to learn trajectory optimization in a customized gym environment.
☆13Oct 30, 2021Updated 4 years ago
CansenJIANG / Pcl_Openni_Tutorial
View on GitHub
☆13Jun 23, 2015Updated 11 years ago
davide97l / rl-traingenerator
View on GitHub
Automatic code generator for training Reinforcement Learning policies
☆11Jan 3, 2021Updated 5 years ago
South-hw / FedPara_ICLR22
View on GitHub
☆12Dec 26, 2024Updated last year
vagelis-chantzis / Arduino-Robot
View on GitHub
Light Seeking and Obstacle Avoiding Robot
☆10Feb 7, 2017Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
brenda-Zheng / Exponential-Predefined-Time-Trajectory-Tracking-Control
View on GitHub
☆20Nov 21, 2023Updated 2 years ago
ducmngx / DDPG-UAV-Efficiency
View on GitHub
Using DDPG agent to control UAV system with energy efficiency
☆16Jan 7, 2023Updated 3 years ago
BenjyWP / deep_meta-learning_guidance_law
View on GitHub
Code for paper "Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control"
☆19May 26, 2019Updated 7 years ago
Alonso94 / Data-efficient-RL
View on GitHub
A python implementation for PILCO algorithm for a robotic arm - tested on mujoco robotics environment
☆12Jan 8, 2020Updated 6 years ago
christinakouridi / multiagent_gym
View on GitHub
Adaptation of DQN, DDQN and COMA for multi-agent Gym environments
☆10Oct 3, 2023Updated 2 years ago
pranscript / ETH-NFT-Twitter-sales-bot
View on GitHub
Twitter-NFT sales bot that tweets individual and sweep sales with images from Opensea, Looksrare, X2Y2, and Blur using Opensea/Looksrare …
☆13Jul 27, 2023Updated 3 years ago
SinaMirrazavi / SESODS_lib
View on GitHub
Learning second order dynamical system
☆13Feb 12, 2019Updated 7 years ago
keep9oing / DRQN-Pytorch-CartPole-v1
View on GitHub
Deep recurrent Q learning on CartPole-v1 environment
☆96Jan 15, 2024Updated 2 years ago
lidiaxp / plannie
View on GitHub
☆12Mar 4, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
apsdehal / ic3net-envs
View on GitHub
Environments with IC3Net paper
☆15Jan 8, 2019Updated 7 years ago
PeixianChen / RL-MPE
View on GitHub
用DDPG/MADDPG/DQN/MADDPG+advantage实验 OpenAI开源的MPE环境
☆24Jun 12, 2018Updated 8 years ago
deligentfool / maddpg
View on GitHub
Multi-Agent Deep Deterministic Policy Gradient implementation with pytorch
☆10Aug 2, 2020Updated 5 years ago
abnsl0014 / Missile-Guidance-System
View on GitHub
A Missile Guidance System to shoot air objects based on their trajectories using RNN.
☆26Feb 19, 2020Updated 6 years ago
kevincrispie / Optimizing-Spacecraft-Trajectories
View on GitHub
Determination of optimal spacecraft landing trajectories via convex optimization
☆19Jun 6, 2019Updated 7 years ago
uc3m-aerospace / DMG
View on GitHub
Direct Method based on GPOPS
☆20Apr 16, 2021Updated 5 years ago
CarlossShi / Competition_3v3snakes
View on GitHub
☆17Jun 23, 2022Updated 4 years ago
mattjhawken / deep-rl-trading
View on GitHub
A transformer-based deep RL trading bot built with PyTorch.
☆13Jan 16, 2025Updated last year
cholazzzb / APF_Swarm_Control_Simulator
View on GitHub
Academic Study of A Multi-Agent Quadrotors (Drones) Simulator with Obstacles and Goals Using the Artificial Potential Field Approach(APF)…
☆19Feb 13, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
KaiYan289 / RL_as_Vitamin_for_Online_Decision_Transformers
View on GitHub
☆16Dec 5, 2024Updated last year
cocolico14 / N-step-Dueling-DDQN-PER-Pacman
View on GitHub
Using N-step dueling DDQN with PER for playing Pacman game
☆22Oct 27, 2019Updated 6 years ago
YueJIN-EE / HIST-MADRL
View on GitHub
Hierarchical and Stable Multiagent Reinforcement Learning for Cooperative Navigation Control
☆14May 5, 2022Updated 4 years ago
wojciechmo / rl-grid-world
View on GitHub
Compare Q-Learning and Expected Value SARSA.
☆11Oct 7, 2018Updated 7 years ago
zer0int / CLIP-text-image-interpretability
View on GitHub
Get CLIP ViT text tokens about an image, visualize attention as a heatmap.
☆15Aug 8, 2023Updated 2 years ago
adik993 / ppo-pytorch
View on GitHub
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆148Jan 12, 2019Updated 7 years ago
srsohn / msgi
View on GitHub
ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies
☆18Jul 16, 2020Updated 6 years ago