Egiob / cfrxLinks
cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.
β37Updated last year
Alternatives and similar repositories for cfrx
Users that are interested in cfrx are comparing it to the libraries listed below
Sorting:
- Vectorization techniques for fast population-based training.β57Updated 3 years ago
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ61Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ56Updated 2 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.β64Updated last year
- A collection of matrix games in JAXβ13Updated last year
- β90Updated last year
- Accelerated minigrid environments with JAXβ156Updated 3 months ago
- Challenging Memory-based Deep Reinforcement Learning Agentsβ109Updated last year
- Accelerated replay buffers in JAXβ46Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ122Updated last year
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β258Updated 3 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ22Updated last year
- A collection of RL algorithms written in JAX.β104Updated 3 years ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ269Updated 4 months ago
- β91Updated last week
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β62Updated 3 weeks ago
- Baselines for gymnax π€β74Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M β¦β44Updated 4 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β59Updated 3 years ago
- Simple single-file baselines for Q-Learning in pure-GPU settingβ233Updated 2 months ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Code for magnetic mirror descent.β16Updated 2 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)β12Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAXβ25Updated last year
- General Modules for JAXβ72Updated 4 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β112Updated 2 years ago
- β325Updated last year
- fast + parallel AlphaZero in JAXβ109Updated last year
- Standard interface for entity based reinforcement learning environments.β38Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information gamesβ119Updated last year