Posted at AAAI 2023
☆11Sep 4, 2025Updated 5 months ago
Alternatives and similar repositories for P3O
Users that are interested in P3O are comparing it to the libraries listed below
Sorting:
- Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239☆23Jun 24, 2023Updated 2 years ago
- ☆18Jul 10, 2022Updated 3 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- GNN模型在引文网络数据集上的代码,包括Cora、Citeseer、Pubmed、ogbn-arxiv☆10Mar 2, 2021Updated 5 years ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- ☆11Oct 3, 2022Updated 3 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- ☆12Jan 30, 2021Updated 5 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- ☆15Jul 1, 2021Updated 4 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 3 years ago
- Mirror Descent Policy Optimization☆42Oct 31, 2020Updated 5 years ago
- Code for NeurIPS paper "Self-Organized Group for Cooperative Multi-agentReinforcement Learning".☆21Feb 20, 2023Updated 3 years ago
- OpenAI Gym environment for Robot Soccer Goal☆18May 17, 2019Updated 6 years ago
- ☆20May 22, 2023Updated 2 years ago
- Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.☆17Jun 23, 2021Updated 4 years ago
- Revisiting Discrete Gradient Estimation in MADDPG☆27Feb 24, 2023Updated 3 years ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆29Jul 25, 2023Updated 2 years ago
- [NeurIPS 2020 Spotlight Oral] "Training Stronger Baselines for Learning to Optimize", Tianlong Chen*, Weiyi Zhang*, Jingyang Zhou, Shiyu …☆29Dec 30, 2021Updated 4 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆26Jun 9, 2021Updated 4 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Feb 3, 2022Updated 4 years ago
- We reproduced DeepMind's results and implement a meta-learning (MLSH) agent which can generalize across minigames.☆29Mar 30, 2021Updated 4 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- Decision Transformer for offline single-agent autonomous highway driving☆28Jun 19, 2023Updated 2 years ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆67May 8, 2023Updated 2 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- ☆33Mar 24, 2023Updated 2 years ago
- ☆33Dec 8, 2022Updated 3 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- ☆32Apr 25, 2021Updated 4 years ago
- ☆17Feb 1, 2026Updated last month
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆33Mar 16, 2020Updated 5 years ago
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- A projet for simulating the rescue after a disaster☆10Dec 4, 2020Updated 5 years ago
- ☆11Nov 13, 2025Updated 3 months ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Code for optimal execution☆12Oct 29, 2020Updated 5 years ago
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago