rllab-snu / tsallis_actor_critic_mujocoLinks
Implementation of Tsallis Actor Critic method
☆61Updated last month
Alternatives and similar repositories for tsallis_actor_critic_mujoco
Users that are interested in tsallis_actor_critic_mujoco are comparing it to the libraries listed below
Sorting:
- ☆30Updated 6 years ago
- ☆20Updated 5 years ago
- Model predictive control under STL constraints☆33Updated last month
- Deep learning tutorials using tensorflow☆22Updated 6 years ago
- Introduction to Deep Reinforcement Learning☆88Updated last month
- Official GitHub Repository for Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value At Risk.☆13Updated last month
- Implementation of Learning Instance-Aware Object Detection Using Determinantal Point Processes [https://arxiv.org/pdf/1805.10765.pdf]☆19Updated 2 years ago
- ☆33Updated last month
- ☆12Updated last month
- ☆30Updated last month
- Official GitHub Repository for TRC:Trust Region Conditional Value at Risk for Safe Reinforcement Learning.☆24Updated last month
- Implementation of Deep Elastic Network☆42Updated last month
- List of our studies related to deep learning☆19Updated 6 years ago
- List of our studies related to autonomous driving☆22Updated 3 years ago
- Customisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.a…☆51Updated 5 years ago
- ☆15Updated 7 years ago
- List of our studies about robotics and computer vision which do not overlap with previous topics☆21Updated 3 years ago
- ☆11Updated last month
- ☆86Updated 4 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 7 years ago
- ☆13Updated last month
- Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)☆74Updated 6 years ago
- ☆44Updated 7 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Updated 6 years ago
- accompanying code for neurips submission "Goal-conditioned Imitation Learning"☆73Updated 2 years ago
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Updated 2 years ago
- ☆114Updated 2 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 6 years ago
- ☆66Updated 5 years ago
- Hindsight policy gradients☆46Updated 5 years ago