yhyu13 / C51-DDPGView external linksLinks
This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)
☆11Sep 14, 2017Updated 8 years ago
Alternatives and similar repositories for C51-DDPG
Users that are interested in C51-DDPG are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of Proximal Policy Optimization Algorithm☆20Mar 7, 2018Updated 7 years ago
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Aug 1, 2017Updated 8 years ago
- A field war simulator in console with C language, inspiring from BattleField 1.☆12May 2, 2022Updated 3 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Nov 8, 2019Updated 6 years ago
- Public data set from the Tempe campus of Arizona State University☆18Sep 10, 2024Updated last year
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆35Nov 28, 2018Updated 7 years ago
- Short-Term Probabilistic Load Forecasting at Low Aggregation Levels Using Convolutional Neural Networks☆10Sep 19, 2019Updated 6 years ago
- Implementation of Continuous Control RL Algorithms☆11Dec 8, 2022Updated 3 years ago
- ☆11Jun 1, 2017Updated 8 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago
- implement of prioritized experience replay☆159Aug 20, 2018Updated 7 years ago
- Framework for building VulkanScenGraph related projects together☆15Oct 7, 2024Updated last year
- A proxy for reverse engineering a communication protocol☆10Jan 17, 2021Updated 5 years ago
- FNV hash collision generator☆12Mar 2, 2017Updated 8 years ago
- A simple useful tutorial of boost☆12May 23, 2017Updated 8 years ago
- Task Success is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors☆12Aug 11, 2024Updated last year
- A demo project of using ChatGPT to create Slate UI with TAPython in Unreal Engine 5. TAPython uses JSON for the user interface, which i…☆17Dec 30, 2023Updated 2 years ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆12Jun 19, 2017Updated 8 years ago
- ☆12Jun 22, 2023Updated 2 years ago
- Repository for my studies of Causal Inference☆10Dec 1, 2019Updated 6 years ago
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Aug 18, 2016Updated 9 years ago
- Software and hardware electronic project around controlling flip-dot matrix displays.☆10May 31, 2019Updated 6 years ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Official implementation of GLSO: Robot Design Automation (CoRL 2022)☆11Sep 21, 2022Updated 3 years ago
- Stochastic Machines for Unsupervised Learning implemented in Pytorch.☆10Sep 3, 2017Updated 8 years ago
- DTLC-GAN Tensorflow☆12Aug 29, 2018Updated 7 years ago
- ☆12Mar 21, 2024Updated last year
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Mar 24, 2023Updated 2 years ago
- ☆10Sep 21, 2021Updated 4 years ago
- ☆16Apr 28, 2023Updated 2 years ago
- Searching for a Strategy: Modelling Player Trajectories in Soccer Games using Social LSTM☆16Dec 20, 2017Updated 8 years ago
- ☆11Dec 16, 2022Updated 3 years ago
- Probability [Instructor: Parsiad Azimzadeh] (University of Michigan, Winter 2018)☆13Aug 7, 2018Updated 7 years ago