thaihungle / EPGT
Episodic Policy Gradient Training
☆13Updated 2 years ago
Alternatives and similar repositories for EPGT:
Users that are interested in EPGT are comparing it to the libraries listed below
- ☆11Updated 2 months ago
- Code of Neurocoder paper☆12Updated 2 years ago
- Neural Stored-program Memory☆24Updated 2 years ago
- Uniform Writing & Cached Uniform Writing☆25Updated 5 years ago
- Variational Memory Encoder-Decoder☆30Updated 5 years ago
- Dual Memory Neural Computer☆24Updated 3 years ago
- Self-attentive Associative Memory & SAM-based Two-Memory Model☆54Updated 2 years ago
- Contextual Bandits Action Elimination DQN☆20Updated 6 years ago
- Reward Propagation using Graph Convolutional Networks☆13Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆17Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Updated 2 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Updated 5 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated 7 months ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 3 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 3 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆20Updated last year
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆9Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Updated 6 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆20Updated 4 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 5 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Variational Reinforcement Learning☆16Updated 6 months ago