google / putting-duneLinks
β10Updated last year
Alternatives and similar repositories for putting-dune
Users that are interested in putting-dune are comparing it to the libraries listed below
Sorting:
- Comparison between GFlowNets & Maximum Entropy RLβ19Updated last year
- πͺ The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAXβ60Updated 2 years ago
- Accelerated minigrid environments with JAXβ152Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β37Updated last year
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discoveryβ16Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]β110Updated last year
- Parallel hyperparameter tuning with JAXβ36Updated 4 months ago
- β57Updated 3 years ago
- Baselines for gymnax π€β72Updated 2 years ago
- Use Jax functions in Pytorchβ255Updated 2 years ago
- β87Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learningβ73Updated last year
- Benchmark for evaluating the generalization capabilities of Multi-Objective Reinforcement Learning (MORL) algorithms.β25Updated 5 months ago
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Functionβ13Updated 3 years ago
- Accelerated replay buffers in JAXβ44Updated 3 years ago
- An Open-Ended Agentic Simulatorβ54Updated last year
- A collection of matrix games in JAXβ12Updated 11 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learningβ31Updated 2 years ago
- Equivariant Steerable CNNs Library for Pytorch https://quva-lab.github.io/escnn/β31Updated 2 years ago
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Methoβ¦β86Updated 2 years ago
- β‘ Flashbax: Accelerated Replay Buffers in JAXβ259Updated 2 months ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β59Updated 3 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learningβ17Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancyβ21Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ117Updated last year
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.β136Updated last year
- β46Updated last year
- β53Updated 3 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Desβ¦β33Updated last year
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".β24Updated 2 years ago