xingchenwan / bgpbtLinks

[AutoML'22] Bayesian Generational Population-based Training (BG-PBT)

☆28

Alternatives and similar repositories for bgpbt

Users that are interested in bgpbt are comparing it to the libraries listed below

Sorting:

Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
google-deepmind / active_ops
☆32Updated 11 months ago
KyunghyunLee / aes-rl
☆17Updated 4 years ago
frt03 / jax_dt
Minimal Decision Transformer Implementation written in Jax (Flax).
☆17Updated 2 years ago
quasimetric-learning / torch-quasimetric
PyTorch Package For Quasimetric Learning
☆42Updated 8 months ago
philipjball / ReadyPolicyOne
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Updated 2 years ago
ucl-dark / pax
Scalable Opponent Shaping Experiments in JAX
☆24Updated last year
vwxyzjn / a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
☆22Updated 3 years ago
icaros-usc / dqd-rl
Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"
☆20Updated 2 years ago
facebookresearch / ssorl
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆42Updated last year
TrentBrick / RewardConditionedUDRL
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆18Updated 4 years ago
holarissun / RewardShifting
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆30Updated last year
ElisevanderPol / symmetrizer
☆31Updated 4 years ago
adaptive-intelligent-robotics / QDAC
Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …
☆16Updated last year
subho406 / Recurrent-PPO-Jax
Implementation of Proximal Policy Optimization in Jax+Flax
☆19Updated 2 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
EmptyJackson / groove
Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…
☆30Updated last year
ben-eysenbach / info_geometry
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Updated 3 years ago
facebookresearch / how-to-autorl
Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…
☆80Updated last year
ben-eysenbach / mnm
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆20Updated 3 years ago
ucl-dark / skillhack
SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning
☆17Updated 2 years ago
montrealrobotics / iv_rl
IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation
☆39Updated 8 months ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
younggyoseo / trajectory_mcl
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Updated 4 years ago
epignatelli / discovering-reinforcement-learning-algorithms
A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…
☆21Updated 4 years ago
schmidtdominik / Rainbow
Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …
☆45Updated 3 years ago
microsoft / segar
Sandbox environment for generalizable agent research
☆25Updated 2 years ago
nmonette / NCC-UED
Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025
☆12Updated last month
zdhNarsil / Stochastic-Marginal-Actor-Critic
Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".
☆24Updated 2 years ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆41Updated 2 years ago