Reinforcement learning algorithms with Generalized Advantage Estimation
☆22Jun 6, 2018Updated 8 years ago
Alternatives and similar repositories for GAE
Users that are interested in GAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Aug 3, 2020Updated 5 years ago
- Source code for Pathfinding in Stochastic Environments paper.☆15Oct 27, 2022Updated 3 years ago
- ☆15Nov 22, 2019Updated 6 years ago
- Asynchronous Advantage Actor-Critic using Generalized Advantage Estimation (PyTorch)☆10Oct 11, 2019Updated 6 years ago
- Implementation of Symbolic Relational Deep Reinforcement Learning based on Graph Neural Networks☆27Aug 24, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This the repository of the accompanying MATLAB codes for the Book Navigation and Tracking in Space: Analysis and Algorithms☆12Jan 15, 2024Updated 2 years ago
- Plotly Dash ユーザーガイドチュートリアル日本語化プロジェクト I am working on Translation of English Dash tutorial into Japanese. This repository will be aborted …☆12Mar 25, 2019Updated 7 years ago
- A toolbox for the main functions used in spacecraft attitude determination and control.☆11Jan 10, 2019Updated 7 years ago
- ☆17Nov 16, 2022Updated 3 years ago
- Proximal Policy Optimization implementation with TensorFlow☆108Oct 9, 2018Updated 7 years ago
- Bayesian Estimation of the GARCH(1,1) Model with Student-t Innovations☆16May 16, 2021Updated 5 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆10Oct 26, 2021Updated 4 years ago
- A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.☆15Nov 4, 2025Updated 8 months ago
- ☆13Sep 15, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A DQN implementation using Keras and Tensorflow☆11Oct 11, 2018Updated 7 years ago
- timeseries prediction using dynamic linear models and LSTM☆13Nov 3, 2017Updated 8 years ago
- Self-Questioning Language Models☆57Mar 30, 2026Updated 3 months ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Calibration tool to estimate the pitch and roll angle of a Kinect-type depth sensor.☆12Mar 17, 2018Updated 8 years ago
- Code for the paper "Continual Model-Based Reinforcement Learning with Hypernetworks"☆15Jul 28, 2021Updated 4 years ago
- 作業系統實作☆13Apr 26, 2018Updated 8 years ago
- A dedicated solver for the capture problem initially presented in S. Caron, B. Mallein "Balance control using both ZMP and COM height var…☆12Oct 16, 2019Updated 6 years ago
- ☆11Jan 20, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Udacity Deep Reinforecment Learning - Implementation of Proximal Policy Optimization (PPO)☆14Nov 1, 2018Updated 7 years ago
- Computational time vs quality comparison between some Edge preserving smoothing filters☆10May 5, 2017Updated 9 years ago
- send and receive message and file by python3 socket☆12May 24, 2018Updated 8 years ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- A tool for experimenting with evolutionary optimization methods for machine learning algorithms, by distributing the workload over a larg…☆14Dec 19, 2018Updated 7 years ago
- Git - basic commands☆16Jun 8, 2021Updated 5 years ago
- ☆11Nov 29, 2021Updated 4 years ago
- 3D linearized quadcopter controller and trajectory generator for solving the Robotics Flight Coursera course assignment☆13Feb 14, 2016Updated 10 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The labs of ARC university courses☆12Aug 29, 2023Updated 2 years ago
- Deep Reinforcement Learning with continuous control in CARLA☆11Dec 8, 2022Updated 3 years ago
- Implement Categorical Variational autoencoder using Pytorch☆15Apr 25, 2018Updated 8 years ago
- Actor Prioritized Experience Replay☆19Nov 20, 2023Updated 2 years ago
- An RL agent for the Google Football environment☆95Jun 19, 2021Updated 5 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- ☆14Jul 27, 2022Updated 3 years ago