rwightman / pytorch-pommerman-rlView external linksLinks
PyTorch RL for Pommerman
☆38Sep 24, 2018Updated 7 years ago
Alternatives and similar repositories for pytorch-pommerman-rl
Users that are interested in pytorch-pommerman-rl are comparing it to the libraries listed below
Sorting:
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37May 9, 2019Updated 6 years ago
- Some baselines for Pommerman competition☆46Jul 18, 2018Updated 7 years ago
- Bomberman deep reinforcement learning challenge in PyTorch☆26Jan 3, 2019Updated 7 years ago
- PlayGround: AI Research into Multi-Agent Learning.☆779Dec 19, 2023Updated 2 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 5 years ago
- Bombing AI agents☆12Jun 21, 2018Updated 7 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆23Feb 15, 2023Updated 3 years ago
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆19Jul 11, 2023Updated 2 years ago
- ☆114Nov 15, 2019Updated 6 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- SOTA on TabFact: Graph Neural Network for Table-based Fact Checking☆18Dec 10, 2020Updated 5 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- TensorFlow & Keras implementation of DQN with HER (Hindsight Experience Replay)☆40Jul 31, 2020Updated 5 years ago
- Hindsight policy gradients☆46Jan 31, 2020Updated 6 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Dec 8, 2022Updated 3 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 2 years ago
- ☆31Jan 7, 2023Updated 3 years ago
- Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)☆101Jan 4, 2021Updated 5 years ago
- [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!☆104Dec 8, 2022Updated 3 years ago
- Process Simulations Meet AI. Supercharge Your Process Engineering. Generate Infinite Data, Train Advanced Models, and Revolutionise Indus…☆11Oct 8, 2024Updated last year
- Basic versions of agents from Spinning Up in Deep RL written in PyTorch☆209May 20, 2021Updated 4 years ago
- The project is an official implementation of our paper "A Lightweight Graph Transformer Network for Human Mesh Reconstruction from 2D Hum…☆28May 25, 2023Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆269May 20, 2020Updated 5 years ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆34Oct 28, 2020Updated 5 years ago
- [IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours☆12Mar 3, 2024Updated last year
- This is the implementation of the visual model mentioned in our paper 'Automated Radiology Report Generation using Conditioned Transforme…☆10Jul 25, 2024Updated last year
- Robotics Learning Note☆11Jun 22, 2018Updated 7 years ago
- ☆11Dec 23, 2024Updated last year
- [SIGKDD' 24] PyTorch implementation of Temporal Prototype-Aware Learning for Active Voltage Control on Power Distribution Networks☆13Jul 28, 2024Updated last year
- ENriching Health data by ANnotations of Crowd and Experts☆10Sep 13, 2021Updated 4 years ago
- Reinforcement Learning Assembly☆92Sep 2, 2021Updated 4 years ago
- ☆35Oct 13, 2021Updated 4 years ago
- reproduce some RL or Multi-Agent models☆35May 22, 2019Updated 6 years ago
- This code accompanies "Differentiable probabilistic models of scientific imaging with the Fourier slice theorem", UAI 2019☆37Jun 26, 2019Updated 6 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆13Jan 1, 2025Updated last year
- PhD Thesis Template with Roboto Font and Color Sections☆11Jul 24, 2020Updated 5 years ago
- Official code for the article "Offline Goal-Conditioned Reinforcement Learning for Safety-Critical Tasks with Recovery Policy"☆8Apr 9, 2024Updated last year
- Workshop on Text Classification at 1729 Conference☆13Sep 4, 2022Updated 3 years ago
- Emergency Vehicle Smart Grid to provide faster movement to emergency vehicles.☆11Dec 12, 2019Updated 6 years ago