kachayev / pyage2Links
"Age of Empires II" Learning Environment
β72Updated 3 years ago
Alternatives and similar repositories for pyage2
Users that are interested in pyage2 are comparing it to the libraries listed below
Sorting:
- Standard interface for entity based reinforcement learning environments.β38Updated last year
- Baselines for gymnax π€β67Updated 2 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated gameβ47Updated 2 years ago
- A grid-world game engine for game AI researchβ243Updated last year
- Accelerated minigrid environments with JAXβ139Updated 2 weeks ago
- A collection of matrix games in JAXβ11Updated 6 months ago
- Fast and procedurally generated side-scroller-game-like graphical environments (formerly Procgen)β29Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ51Updated 2 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.β32Updated 10 months ago
- Simple JAX Graphics Library.β36Updated 7 months ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ100Updated 7 months ago
- Efficient baselines for autocurricula in JAX.β190Updated 10 months ago
- β82Updated 3 months ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)β257Updated 11 months ago
- MiniZero: An AlphaZero and MuZero Training Frameworkβ94Updated 4 months ago
- Vectorization techniques for fast population-based training.β56Updated 2 years ago
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papersβ52Updated 2 years ago
- The source code for the gym-microrts paper.β42Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environmβ¦β41Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.β60Updated 7 months ago
- A toolkit for practical Human-AI cooperation researchβ14Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ113Updated 10 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ18Updated 7 months ago
- β51Updated 2 years ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".β85Updated last year
- Accelerated replay buffers in JAXβ41Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M β¦β45Updated 3 years ago
- Gymnasium extension for DarkSouls III, Elden Ring, and other Souls gamesβ133Updated 8 months ago
- An Open-Ended Agentic Simulatorβ50Updated 10 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)β11Updated 2 years ago