eugene/pommerman

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eugene/pommerman)

eugene / pommerman

Bomberman deep reinforcement learning challenge in PyTorch

☆27

Alternatives and similar repositories for pommerman

Users that are interested in pommerman are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BorealisAI / pommerman-baseline
View on GitHub
Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"
☆38May 9, 2019Updated 7 years ago
rwightman / pytorch-pommerman-rl
View on GitHub
PyTorch RL for Pommerman
☆39Sep 24, 2018Updated 7 years ago
tambetm / pommerman-baselines
View on GitHub
Some baselines for Pommerman competition
☆46Jul 18, 2018Updated 8 years ago
SICC-Group / DDFG
View on GitHub
This is the official code for our paper entitled "Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning".
☆10Updated this week
jidiai / Competition_3v3snakes
View on GitHub
☆39Jul 21, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pucrs-ai-cs / reinforcement
View on GitHub
Reinforcement Learning
☆12Jun 22, 2017Updated 9 years ago
MultiAgentLearning / playground
View on GitHub
PlayGround: AI Research into Multi-Agent Learning.
☆796Dec 19, 2023Updated 2 years ago
gmargo11 / hDQN
View on GitHub
Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)
☆37May 18, 2019Updated 7 years ago
harish-kamath / rqae
View on GitHub
Residual Quantization Autoencoder, used for interpreting LLMs
☆14Jan 1, 2025Updated last year
ZhuohuiZhang / TGCNet
View on GitHub
This is the official implementation of [AAAI'25 Oral] accepted paper: Bridging Training and Execution via Dynamic Directed Graph-Based Co…
☆17Feb 11, 2025Updated last year
tjuHaoXiaotian / GASIL
View on GitHub
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
☆32Oct 9, 2018Updated 7 years ago
seolhokim / BipedalWalker-BranchingDQN
View on GitHub
The Easiest Pytorch Implementation of Branching-DQN
☆12Feb 10, 2021Updated 5 years ago
LiZhYun / RL-Plotter-with-Wandb
View on GitHub
A plotter for reinforcement learning (RL) using Weights & Biases
☆14Dec 20, 2023Updated 2 years ago
tqch / poisson-jump
View on GitHub
Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)
☆10Jun 6, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ArthurConmy / MishformerLens
View on GitHub
MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…
☆10Oct 7, 2024Updated last year
0b01 / CommNet
View on GitHub
PyTorch implementation of CommNet
☆37Dec 2, 2017Updated 8 years ago
deep-diver / LLM-Serve
View on GitHub
This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.
☆18Apr 20, 2023Updated 3 years ago
sdpa-python / sdpa-python
View on GitHub
SemiDefinite Programming Algorithm (SDPA) for Python
☆12Jul 1, 2026Updated 3 weeks ago
skku-taehwan / KoreanRecipeGPT
View on GitHub
ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!
☆11Dec 28, 2021Updated 4 years ago
c-cube / ocaml-avro
View on GitHub
[DEPRECATED (use avro-simple)] Runtime library and schema compiler for the Avro serialization format.
☆21Jul 7, 2026Updated 3 weeks ago
QDPP-GitHub / QDPP
View on GitHub
Multi-Agent Determinantal Q-Learning
☆43Nov 22, 2022Updated 3 years ago
seanjhardy / HyperLife
View on GitHub
A realtime multicellular organism evolution simulator with Verlet integration
☆12May 30, 2021Updated 5 years ago
jkomiyama / duelingbanditlib
View on GitHub
☆12May 22, 2016Updated 10 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Tim-ats-d / Macron
View on GitHub
A powerful keybind library and daemon for Linux.
☆11Jul 24, 2022Updated 4 years ago
cqian19 / qmix-plus
View on GitHub
Improving upon state of the art cooperative deep reinforcement learning in StarCraft II
☆13May 16, 2019Updated 7 years ago
ZishunYu / Actor-Critic-Alignment
View on GitHub
Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''
☆13Oct 12, 2023Updated 2 years ago
alec-tschantz / planet
View on GitHub
PlaNet: Learning Latent Dynamics for Planning from Pixels
☆10Feb 13, 2020Updated 6 years ago
lafeychine / scala-native-sfml
View on GitHub
Scala Native 3 bindings for SFML library
☆15Jul 9, 2023Updated 3 years ago
annieyan / Bandits-using-UCB-algorithm
View on GitHub
Thompson Sampling for Bandits using UCB policy
☆10Jul 29, 2017Updated 9 years ago
Nexusphobiker / MHWSaveEditor
View on GitHub
Work in progress save editor for Monster Hunter: World
☆11Aug 15, 2018Updated 7 years ago
jbkjr / train-procgen-pytorch
View on GitHub
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆14May 17, 2024Updated 2 years ago
mingzhangPHD / transferlearning
View on GitHub
Everything about Transfer Learning and Domain Adaptation--迁移学习
☆10Jun 5, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xtma / apo
View on GitHub
Average-Reward Reinforcement Learning with Trust Region Methods
☆11Oct 17, 2022Updated 3 years ago
anoulis / Reactive_Traffic_Lights
View on GitHub
Emergency Vehicle Smart Grid to provide faster movement to emergency vehicles.
☆11Dec 12, 2019Updated 6 years ago
nlapier2 / metapheno
View on GitHub
Repository for the T2D/obesity experiments run in the Metapheno paper
☆15Feb 6, 2019Updated 7 years ago
papkov / pommerman-x
View on GitHub
Bombing AI agents
☆12Jun 21, 2018Updated 8 years ago
vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
cgrivera / ai-arena
View on GitHub
The AI Arena: A framework for distributed multi-agent reinforcement learning
☆14Aug 5, 2022Updated 3 years ago
herbwood / pytorch_faster_r_cnn
View on GitHub
pytorch faster r-cnn
☆11Dec 21, 2020Updated 5 years ago