benzyx / DomRLLinks
DomRL is a simulation environment for the card game Dominion, created by Donald X Vaccarino, meant to simplify the development and testing of various AI strategies, specifically Reinforcement Learning algorithms.
☆17Updated 4 years ago
Alternatives and similar repositories for DomRL
Users that are interested in DomRL are comparing it to the libraries listed below
Sorting:
- A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing t…☆13Updated 2 years ago
- PyTorch interface for TrueGrad Optimizers☆42Updated 2 years ago
- Read Google Cloud Storage, Azure Blobs, and local paths with the same interface☆66Updated this week
- 🃏♠️♥️♦️♣️☆28Updated 5 years ago
- An implementation of AlphaZero and MCTS with neural networks for Tetris☆22Updated 5 months ago
- Python library for argument and configuration management☆55Updated 2 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- Lightweight Cluster/Cloud VM Job Management 🚀☆42Updated last year
- Learning to play Settlers of Catan with Deep RL - custom training environment and implementation of PPO☆86Updated 3 years ago
- Contrastive Language-Image Pretraining☆144Updated 3 years ago
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago
- Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.☆24Updated 3 years ago
- Let's solve the flatland challenge!☆73Updated last year
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆31Updated 2 years ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Updated last year
- Autoregressive transformer in JAX from scratch☆23Updated 3 years ago
- AlphaZero in JAX☆78Updated last year
- NEVIS'22: Benchmarking the next generation of never-ending learners☆102Updated 2 years ago
- Portfolio REgret for Confidence SEquences☆20Updated 8 months ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- ☆57Updated 3 years ago
- ☆20Updated 6 years ago
- Experiment. Plot. Tabulate.☆71Updated last year
- fast + parallel AlphaZero in JAX☆98Updated 8 months ago
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆241Updated 2 years ago
- Car racing RL agents in actual F1 tracks☆13Updated 10 months ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆45Updated last year
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- A small library for creating and manipulating custom JAX Pytree classes☆56Updated 2 years ago
- Experiments to assess SPADE on different LLM pipelines.☆17Updated last year