Egiob / cfrxLinks
cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.
β37Updated last year
Alternatives and similar repositories for cfrx
Users that are interested in cfrx are comparing it to the libraries listed below
Sorting:
- A project that provides help for using DeepMind's mctx on gym-style environments.β64Updated last year
- Baselines for gymnax π€β74Updated 2 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorchβ56Updated 2 years ago
- A collection of matrix games in JAXβ13Updated last year
- Vectorization techniques for fast population-based training.β57Updated 3 years ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ270Updated 4 months ago
- Accelerated minigrid environments with JAXβ156Updated 3 months ago
- Accelerated replay buffers in JAXβ46Updated 3 years ago
- β91Updated 3 weeks ago
- A collection of RL algorithms written in JAX.β104Updated 3 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information gamesβ119Updated last year
- β90Updated last year
- General Modules for JAXβ72Updated 4 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β59Updated 3 years ago
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ61Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β112Updated 2 years ago
- JAX implementations of core Deep RL algorithmsβ83Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ122Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environmβ¦β43Updated 3 years ago
- JAX implementation of RL algorithms and vectorized environmentsβ51Updated 2 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.β138Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agentsβ109Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)β12Updated 2 years ago
- An implementation of MuZero in JAX.β57Updated 3 years ago
- fast + parallel AlphaZero in JAXβ109Updated last year
- Simple single-file baselines for Q-Learning in pure-GPU settingβ233Updated 2 months ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!β260Updated 3 months ago
- A C++ pytorch implementation of MuZeroβ40Updated last year
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ23Updated last year
- Partially Observable Process Gymβ212Updated 8 months ago