Egiob / cfrxLinks
cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.
β35Updated last year
Alternatives and similar repositories for cfrx
Users that are interested in cfrx are comparing it to the libraries listed below
Sorting:
- Baselines for gymnax π€β72Updated 2 years ago
- Vectorization techniques for fast population-based training.β56Updated 3 years ago
- A project that provides help for using DeepMind's mctx on gym-style environments.β63Updated 11 months ago
- β86Updated last year
- Accelerated replay buffers in JAXβ43Updated 3 years ago
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ60Updated 2 years ago
- β87Updated 2 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ21Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β61Updated last month
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ53Updated 2 years ago
- A collection of matrix games in JAXβ12Updated 11 months ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Accelerated minigrid environments with JAXβ151Updated 2 weeks ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β59Updated 3 years ago
- A collection of RL algorithms written in JAX.β104Updated 3 years ago
- An Open-Ended Agentic Simulatorβ52Updated last year
- General Modules for JAXβ69Updated last month
- Simple single-file baselines for Q-Learning in pure-GPU settingβ188Updated 7 months ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ258Updated last month
- Code for Discovered Policy Optimisation (NeurIPS 2022)β12Updated 2 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β243Updated last week
- An implementation of MuZero in JAX.β57Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ117Updated last year
- Standard interface for entity based reinforcement learning environments.β38Updated last year
- Scalable Opponent Shaping Experiments in JAXβ24Updated last year
- JAX implementations of various deep reinforcement learning algorithms.β25Updated 9 months ago
- Code for magnetic mirror descent.β16Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M β¦β44Updated 3 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ72Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.β135Updated last year