benzyx / DomRL
DomRL is a simulation environment for the card game Dominion, created by Donald X Vaccarino, meant to simplify the development and testing of various AI strategies, specifically Reinforcement Learning algorithms.
☆17Updated 4 years ago
Alternatives and similar repositories for DomRL:
Users that are interested in DomRL are comparing it to the libraries listed below
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Updated last year
- A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing t…☆13Updated 2 years ago
- Portfolio REgret for Confidence SEquences☆14Updated 3 months ago
- fast + parallel AlphaZero in JAX☆94Updated 3 months ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- Image augmentation library for Jax☆39Updated 11 months ago
- AlphaZero in JAX☆77Updated last year
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆46Updated 2 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- ☆20Updated 5 years ago
- A port of muP to JAX/Haiku☆25Updated 2 years ago
- Scaling scaling laws with board games.☆48Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- ☆8Updated 2 years ago
- Gym environment for playing Wordle with RL agents☆39Updated 3 years ago
- The code used to power DeepRole☆35Updated 2 years ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Updated last year
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆24Updated 6 years ago
- PyTorch interface for TrueGrad Optimizers☆42Updated last year
- An Open-Ended Agentic Simulator☆45Updated 7 months ago
- Python library for argument and configuration management☆54Updated 2 years ago
- Fast Discounted Cumulative Sums in PyTorch☆95Updated 3 years ago
- This repository contains the Julia code for the paper "Competitive Gradient Descent"☆23Updated 5 years ago
- Learning to play Settlers of Catan with Deep RL - custom training environment and implementation of PPO☆86Updated 2 years ago
- flexible meta-learning in jax☆12Updated last year
- GPT implementation in Flax☆18Updated 3 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆108Updated 2 years ago