Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Oct 23, 2021Updated 4 years ago
Alternatives and similar repositories for reinforcement_learning_v_mpo
Users that are interested in reinforcement_learning_v_mpo are comparing it to the libraries listed below
Sorting:
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- Propose & vote on reading group papers in the "Discussions" tab.☆12Feb 20, 2024Updated 2 years ago
- Gandalf - Generic ANd DistAnce-invariant Laser Features☆14Aug 23, 2014Updated 11 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year
- ☆16May 5, 2022Updated 3 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- Thermodynamics tool for H2O, H2, and CO2☆10May 12, 2023Updated 2 years ago
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Oct 25, 2022Updated 3 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- A step-by-step guide for surrogate optimization using Gaussian Process surrogate model☆32Dec 17, 2020Updated 5 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆71Jul 17, 2025Updated 7 months ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆84Nov 19, 2022Updated 3 years ago
- Information and Materials for the Deep Learning Course☆31Jun 16, 2022Updated 3 years ago
- Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization☆83Apr 13, 2023Updated 2 years ago
- Code for our TVCG paper "DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera".☆19Aug 22, 2025Updated 6 months ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- A bot for automatically completing the KAIST safety course☆10Aug 29, 2023Updated 2 years ago
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- 小智ai机器人☆10Mar 8, 2025Updated 11 months ago
- Deploy Yolo series algorithms on Hisilicon platform hi3516, including yolov3, yolov5, yolox, etc☆11Mar 25, 2022Updated 3 years ago
- Official Repository for "Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing" (ICML2021)☆10Oct 26, 2021Updated 4 years ago
- Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.☆11Feb 25, 2025Updated last year
- The FaceFX Unreal Engine 5 plugin.☆10Updated this week
- ☆14Mar 27, 2025Updated 11 months ago
- Implementation of Probabilistic Roadmap Path Planning Algorithm.☆42Sep 19, 2023Updated 2 years ago
- ROS simulation of a UR5 robot to pick objects and inverse kinematics implementation in Python. Includes notebooks on Inverse Kinematics t…☆12Jul 2, 2021Updated 4 years ago
- Snapcraft instructions and config file for ros realsense + librealsense snap installs.☆10Jul 15, 2020Updated 5 years ago
- Code for ChunkFusion: A Learning-based RGB-D 3D Reconstruction Framework via Chunk-wise Integration☆12Apr 7, 2022Updated 3 years ago
- MSCKF with loop closure from ORB_SLAM2☆10Dec 27, 2019Updated 6 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- ☆11May 2, 2022Updated 3 years ago
- ☆32Feb 27, 2026Updated last week
- ☆11Jan 16, 2025Updated last year
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- ☆11Oct 27, 2020Updated 5 years ago