PPO with Hindsight Experience Replay (HER)
☆12May 8, 2018Updated 7 years ago
Alternatives and similar repositories for Hindsight-Experience-Replay
Users that are interested in Hindsight-Experience-Replay are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago
- A paper list of sample-efficient reinforcement learning☆18Jan 12, 2022Updated 4 years ago
- 无人机编队重构☆12Jul 28, 2018Updated 7 years ago
- ☆14Jun 15, 2021Updated 4 years ago
- ☆18Mar 19, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆159Jul 10, 2024Updated last year
- a representation learning method that predicts the Fourier transform of state sequences to improve sample efficiency of RL algorithms.☆20Oct 26, 2023Updated 2 years ago
- Using curiosity-driven approaches to enhance navigation through labyrinthian environments☆14Oct 26, 2022Updated 3 years ago
- Modeling and Simulation Project☆13Apr 6, 2021Updated 5 years ago
- Code for paper "Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control"☆10May 26, 2019Updated 6 years ago
- gym_fetch_env with insert drawer open door☆13Mar 22, 2022Updated 4 years ago
- ☆15Sep 28, 2022Updated 3 years ago
- ☆14Nov 4, 2022Updated 3 years ago
- PNDbotics model files (urdf/mjcf + meshes, etc)☆23Mar 27, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The name "KnowledgeMap" tries to use the metaphor of a cartographic map. If we represent all the different areas of knowledge as a bidime…☆12Mar 29, 2016Updated 10 years ago
- Rank TD: End-to-End Robotic Reinforcement Learning without Reward Engineering and Demonstrations☆14Oct 8, 2022Updated 3 years ago
- ☆11Oct 26, 2022Updated 3 years ago
- Code for paper "Learning to Guide: Guidance Law Based on Deep Meta-learning and Model Predictive Path Integral Control"☆18May 26, 2019Updated 6 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆31Oct 9, 2025Updated 6 months ago
- ☆11Jan 19, 2021Updated 5 years ago
- NordeaGo is a wrapper for the Nordea Open Banking API written in Go☆13Mar 7, 2019Updated 7 years ago
- ☆13Dec 8, 2022Updated 3 years ago
- ☆20Dec 14, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12Jul 13, 2023Updated 2 years ago
- ☆13Feb 5, 2025Updated last year
- Robotic Welding Path Planning demo on various workpiece. Support multilayer planning for V shape groove.☆45Nov 15, 2024Updated last year
- ☆13Jan 16, 2018Updated 8 years ago
- AI path planning and controller for formations of drones.☆16Apr 8, 2021Updated 5 years ago
- minecraft .mca region data file pure js parser☆20Oct 11, 2014Updated 11 years ago
- STM32F103xB Firmware. CMSIS-DAP (USB HID) + 2 x High-speed UART (USB CDC) + UART/SLCAN + USB-I2C☆12Feb 8, 2020Updated 6 years ago
- ☆26Aug 16, 2023Updated 2 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Frinkiac and Morbotron apps for Slack☆15Mar 13, 2019Updated 7 years ago
- Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning☆12Dec 20, 2020Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆18Oct 18, 2022Updated 3 years ago
- Minimal TensorFlow implementation of the Advantage Actor-Critic model for Atari games☆12Feb 15, 2018Updated 8 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆147Jan 12, 2019Updated 7 years ago
- Personal finance manager that aims at being powerful and intuitive☆11May 28, 2020Updated 5 years ago
- ☆14Apr 4, 2023Updated 3 years ago