TensorFlow implementation of "Playing hard exploration games by watching YouTube"
☆39Sep 15, 2019Updated 6 years ago
Alternatives and similar repositories for HardRLWithYoutube
Users that are interested in HardRLWithYoutube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)☆12Jan 31, 2019Updated 7 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Jan 31, 2020Updated 6 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- ☆13Mar 26, 2019Updated 7 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆27May 4, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- ☆10Jul 20, 2023Updated 2 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!☆104Dec 8, 2022Updated 3 years ago
- Code for the Reset-free Trial and Error learning paper (RTE) experiments☆10Jan 3, 2018Updated 8 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 5 years ago
- Fast asynchronous GPU monitoring tool across multiple machines through SSH☆11Nov 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- ☆40Oct 30, 2021Updated 4 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- ☆29Apr 16, 2021Updated 5 years ago
- Code companion of Multi-task Learning for Aggregated Data using Gaussian Processes paper☆11Apr 6, 2020Updated 6 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆30May 29, 2019Updated 6 years ago
- ☆12Aug 28, 2020Updated 5 years ago
- An attempt to reverse engineer custom file formats used by the game Outlaws from LucasArts.☆16Nov 3, 2018Updated 7 years ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RL Recommendation System☆13Aug 30, 2019Updated 6 years ago
- 2019年腾讯广告算法大赛rank68☆14Jun 14, 2019Updated 6 years ago
- This project contains several Deep Reinforcement Learning method and some experiments basd on OpenAi gym.☆19Jan 28, 2018Updated 8 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Apr 23, 2020Updated 5 years ago
- 拍拍贷"魔镜杯”风控大赛☆12Dec 22, 2016Updated 9 years ago
- PyTorch implementation of both discrete and continuous ACER☆25Jan 27, 2019Updated 7 years ago
- Searching for a Strategy: Modelling Player Trajectories in Soccer Games using Social LSTM☆16Dec 20, 2017Updated 8 years ago
- Linear Algebra for Machine Learning Book Exercises☆13May 19, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆39Aug 25, 2025Updated 7 months ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Oct 8, 2018Updated 7 years ago
- Python Library for Dynamic Movement Primitives with Reinforcement Learning☆14Jun 21, 2022Updated 3 years ago
- Source code for 'Real-Time Web Application Development' by Rami Vemula☆12Dec 18, 2017Updated 8 years ago
- ☆14May 31, 2022Updated 3 years ago
- Spectral Method for Multiple Experts Inverse Reinforcement Learning☆14Sep 6, 2014Updated 11 years ago