google / putting-duneLinks
β10Updated last year
Alternatives and similar repositories for putting-dune
Users that are interested in putting-dune are comparing it to the libraries listed below
Sorting:
- Comparison between GFlowNets & Maximum Entropy RLβ19Updated last year
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ58Updated last year
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discoveryβ16Updated 2 years ago
- Parallel hyperparameter tuning with JAXβ34Updated last week
- β55Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β107Updated last year
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β55Updated 2 months ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".β24Updated 2 years ago
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!β33Updated 2 months ago
- Benchmark for evaluating the generalization capabilities of Multi-Objective Reinforcement Learning (MORL) algorithms.β21Updated last month
- β81Updated 8 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β35Updated last year
- Accelerated replay buffers in JAXβ43Updated 2 years ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Functionβ13Updated 2 years ago
- β50Updated 3 years ago
- A PyTorch implementation of a Generative Flow Network (GFlowNet) proposed by Bengio et al. (2021)β42Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learningβ73Updated 11 months ago
- JAX implementations of core Deep RL algorithmsβ82Updated 3 years ago
- Accelerated minigrid environments with JAXβ139Updated last month
- An Open-Ended Agentic Simulatorβ51Updated 11 months ago
- Building blocks for productive researchβ59Updated 5 months ago
- β82Updated 4 months ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ239Updated this week
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Methoβ¦β85Updated 2 years ago
- General Modules for JAXβ66Updated 3 months ago
- β44Updated 10 months ago
- Use Jax functions in Pytorchβ248Updated 2 years ago
- Baselines for gymnax π€β67Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learningβ17Updated 2 years ago
- An implementation of MuZero in JAX.β56Updated 2 years ago