BorealisAI / pommerman-baselineView external linksLinks
Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆37May 9, 2019Updated 6 years ago
Alternatives and similar repositories for pommerman-baseline
Users that are interested in pommerman-baseline are comparing it to the libraries listed below
Sorting:
- PyTorch RL for Pommerman☆38Sep 24, 2018Updated 7 years ago
- Some baselines for Pommerman competition☆46Jul 18, 2018Updated 7 years ago
- Bombing AI agents☆12Jun 21, 2018Updated 7 years ago
- PlayGround: AI Research into Multi-Agent Learning.☆779Dec 19, 2023Updated 2 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 5 years ago
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 7 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆50Feb 23, 2019Updated 6 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 6 years ago
- train, deploy, and make inferences using deep reinforcement learning to solve the Travelling Salesperson Problem☆20Dec 22, 2023Updated 2 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- PRML Page-by-page配套资料,对PRML全书及各章节的review☆17Apr 16, 2024Updated last year
- path finding algorithms☆17Apr 17, 2024Updated last year
- ☆25Nov 30, 2020Updated 5 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Jul 17, 2019Updated 6 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆23Feb 15, 2023Updated 2 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆67Feb 14, 2020Updated 5 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 2 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆656Apr 6, 2021Updated 4 years ago
- It's the pytorch implementation of google research football.☆43Jun 14, 2019Updated 6 years ago
- Reinforcement Learning implementations and research prototyping in TensorFlow☆81Apr 28, 2019Updated 6 years ago
- ☆31Jan 7, 2023Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 2 years ago
- Find best-response to a fixed policy in multi-agent RL☆288Apr 1, 2022Updated 3 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings☆96Jun 8, 2018Updated 7 years ago
- An environment for benchmarking commonsense agents☆29Aug 19, 2020Updated 5 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆67Nov 4, 2018Updated 7 years ago
- MAGNet: Multi-agents control using Graph Neural Networks☆132Mar 28, 2019Updated 6 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆374Oct 15, 2021Updated 4 years ago
- Conflict-Based Search and Enhanced CBS in Julia☆35Feb 18, 2021Updated 4 years ago
- starter kit for vizdoom2018-singleplayer track☆28Jul 29, 2018Updated 7 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆36Dec 8, 2022Updated 3 years ago
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆13Jul 28, 2024Updated last year
- On the pitfalls of measuring emergent communication☆34Mar 12, 2019Updated 6 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago