Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆37May 9, 2019Updated 6 years ago
Alternatives and similar repositories for pommerman-baseline
Users that are interested in pommerman-baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch RL for Pommerman☆39Sep 24, 2018Updated 7 years ago
- Bomberman deep reinforcement learning challenge in PyTorch☆26Jan 3, 2019Updated 7 years ago
- Some baselines for Pommerman competition☆46Jul 18, 2018Updated 7 years ago
- Bombing AI agents☆12Jun 21, 2018Updated 7 years ago
- PlayGround: AI Research into Multi-Agent Learning.☆784Dec 19, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 7 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 7 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Jul 17, 2019Updated 6 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆49Feb 23, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆68Feb 14, 2020Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206☆11Aug 30, 2024Updated last year
- train, deploy, and make inferences using deep reinforcement learning to solve the Travelling Salesperson Problem☆19Dec 22, 2023Updated 2 years ago
- An environment for benchmarking commonsense agents☆29Aug 19, 2020Updated 5 years ago
- A standalone library to randomize various OpenAI Gym Environments☆66Sep 29, 2019Updated 6 years ago
- Ancestral Gumbel-Top-k Sampling☆25Apr 11, 2020Updated 5 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 3 years ago
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆660Apr 6, 2021Updated 4 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- A branch-and-bound ILP solver☆27Apr 22, 2019Updated 6 years ago
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆52Jun 28, 2020Updated 5 years ago
- Multi-perspective council analysis plugin for Claude Code. Spawns parallel cognitive perspectives to analyze questions, plans, and ideas …☆68Updated this week
- ☆11Oct 14, 2019Updated 6 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago
- [ICLR 2024] DMBP: Diffusion Model-Based Predictor for Robust Offline Reinforcement Learning against State Observations Perturbations.☆17May 24, 2024Updated last year
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆14Jun 3, 2025Updated 9 months ago
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆202May 25, 2020Updated 5 years ago
- path finding algorithms☆17Apr 17, 2024Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year