yangkevin2 / neurips2021-lap3
☆17Updated 2 years ago
Alternatives and similar repositories for neurips2021-lap3:
Users that are interested in neurips2021-lap3 are comparing it to the libraries listed below
- Variational Reinforcement Learning☆16Updated 5 months ago
- Generalised UDRL☆37Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- ☆19Updated 3 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆10Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- An implementation of MuZero in JAX.☆54Updated 2 years ago
- Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"☆19Updated 2 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆38Updated 2 months ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 2 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆24Updated 6 months ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated last year
- Code for Optimistic Exploration even with a Pessimistic Initialisation☆14Updated 4 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆24Updated 3 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆32Updated last year
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆24Updated 2 years ago
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- ☆16Updated 3 years ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆20Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆16Updated 3 years ago
- Clockwork VAEs in JAX/Flax☆32Updated 3 years ago