hsvgbkhgbv / Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learning
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learning
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- ☆14Updated 4 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆10Updated 3 years ago
- ☆44Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆28Updated 4 years ago
- ☆13Updated 5 years ago
- ☆30Updated last year
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆23Updated 2 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Updated 5 years ago
- Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization☆35Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Reinforcement Learning with Convex Constraints☆14Updated 2 years ago
- ☆29Updated 3 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- ☆28Updated last year
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆15Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- ☆85Updated 3 months ago
- Code for Expert Supervised Reinforcement Learning☆10Updated 3 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- Disagreement-Regularized Imitation Learning☆30Updated 3 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 4 months ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 5 years ago
- ☆11Updated 5 years ago
- Representation Learning in RL☆16Updated 2 years ago
- ☆36Updated last year
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated last year