TensorFlow implementation of "Playing hard exploration games by watching YouTube"
☆39Sep 15, 2019Updated 6 years ago
Alternatives and similar repositories for HardRLWithYoutube
Users that are interested in HardRLWithYoutube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)☆12Jan 31, 2019Updated 7 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Jan 31, 2020Updated 6 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Oct 27, 2020Updated 5 years ago
- A game for experimenting with sensorimotor AI.☆16May 9, 2014Updated 11 years ago
- Implement Conditional VAE and train on MNIST by tensorflow 1.3.0.☆10Nov 7, 2017Updated 8 years ago
- ☆13Mar 26, 2019Updated 6 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- MicroPython STM Read Protection Module☆12Nov 5, 2014Updated 11 years ago
- ☆10Sep 20, 2018Updated 7 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.☆27May 4, 2021Updated 4 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- ☆10Jul 20, 2023Updated 2 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- Visualizing the learned space-time attention using Attention Rollout☆40Apr 1, 2022Updated 3 years ago
- [IJCAI'20][ICLR'19 Workshop] Flow-based Intrinsic Curiosity Module. Playing SuperMario with RL agent and FICM!☆104Dec 8, 2022Updated 3 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 5 years ago
- Fast asynchronous GPU monitoring tool across multiple machines through SSH☆11Nov 26, 2024Updated last year
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- ☆40Oct 30, 2021Updated 4 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- ☆29Apr 16, 2021Updated 4 years ago
- Public Repo for the paper "Overcoming The Spectral-Bias of Neural Value Approximation"☆11May 25, 2024Updated last year
- ☆11Sep 29, 2021Updated 4 years ago
- pix2pix and Cycle GAN architectures for image style transfer☆13May 27, 2021Updated 4 years ago
- Companion code to CoRL 2018 paper: E Bıyık, D Sadigh. "Batch Active Preference-Based Learning of Reward Functions". Conference on Robot L…☆30May 29, 2019Updated 6 years ago
- An attempt to reverse engineer custom file formats used by the game Outlaws from LucasArts.☆16Nov 3, 2018Updated 7 years ago
- ☆12Aug 28, 2020Updated 5 years ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 9 months ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- MaxSum is an algorithm about Distributed Constraint Optimization Problems (DCOPs)☆11Jan 15, 2018Updated 8 years ago
- [ICLR 2023] Choreographer: a world-model-based agent that discovers and learns unsupervised skills in latent imagination, and it's able t…☆42Jun 18, 2024Updated last year
- Integrating opencv with mujoco.☆11Mar 25, 2025Updated 11 months ago
- Introduction to Gaussian Processes☆11Jan 13, 2024Updated 2 years ago
- PyTorch implementation of both discrete and continuous ACER☆25Jan 27, 2019Updated 7 years ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago