Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆38May 9, 2019Updated 7 years ago
Alternatives and similar repositories for pommerman-baseline
Users that are interested in pommerman-baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch RL for Pommerman☆39Sep 24, 2018Updated 7 years ago
- Bomberman deep reinforcement learning challenge in PyTorch☆27Jan 3, 2019Updated 7 years ago
- Some baselines for Pommerman competition☆46Jul 18, 2018Updated 7 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 6 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆25Nov 30, 2020Updated 5 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆24Feb 15, 2023Updated 3 years ago
- ☆31Jul 1, 2019Updated 6 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- PRML Page-by-page配套资料,对PRML全书及各章节的review☆17Apr 16, 2024Updated 2 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆49Feb 23, 2019Updated 7 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆67Feb 14, 2020Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆276Apr 18, 2020Updated 6 years ago
- OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206☆11Aug 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- train, deploy, and make inferences using deep reinforcement learning to solve the Travelling Salesperson Problem☆19Dec 22, 2023Updated 2 years ago
- Working 8x8 systolic array hardware implemented in Xilinx Vivado, operated and controlled in software using Xilinx Vitis☆19Feb 16, 2024Updated 2 years ago
- Ancestral Gumbel-Top-k Sampling☆25Apr 11, 2020Updated 6 years ago
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆663Apr 6, 2021Updated 5 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- A branch-and-bound ILP solver☆27Apr 22, 2019Updated 7 years ago
- Code to reproduce the NeurIPS 2019 paper "Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottlen…☆52Jun 28, 2020Updated 5 years ago
- ☆31Mar 26, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Safe Reinforcement Learning with Natural Language Constraints☆16Oct 24, 2021Updated 4 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Jul 31, 2020Updated 5 years ago
- Hardware and Software Co-design implementations☆16Dec 5, 2019Updated 6 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Global Dark Mode for ALL apps on ANY platforms.☆19Oct 3, 2023Updated 2 years ago
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 7 years ago
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆15Jun 3, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A binary release of trained deep reinforcement learning models trained in the Atari machine learning benchmark, and a software release th…☆204May 25, 2020Updated 6 years ago
- Official gym API for game FightingICE.☆15Feb 17, 2024Updated 2 years ago
- path finding algorithms☆17Apr 17, 2024Updated 2 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- It's the pytorch implementation of google research football.☆43Jun 14, 2019Updated 6 years ago
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- @ngrok/mantle ui component library☆15May 15, 2026Updated last week