A toy example of Policy Gradient implemented in Pytorch
☆95Jan 24, 2018Updated 8 years ago
Alternatives and similar repositories for pytorch-policy-gradient-example
Users that are interested in pytorch-policy-gradient-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 4 years ago
- Actor Critic model to play Cartpole game☆53Aug 4, 2018Updated 7 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- ☆12Jun 9, 2022Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆96Mar 1, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Nov 27, 2019Updated 6 years ago
- Faithful Python implementation of the paper "Towards Deep Symbolic Reinforcement Learning" by Garnelo et al.☆13Mar 23, 2021Updated 5 years ago
- Policy Gradient Actor-Critic PyTorch | Lunar Lander v2☆75May 7, 2019Updated 6 years ago
- 课程笔记,David Silver,CS294 ...☆15Jan 7, 2019Updated 7 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Dec 26, 2019Updated 6 years ago
- hierarchical Q-learning implementation☆11Jun 9, 2015Updated 10 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Sep 20, 2021Updated 4 years ago
- Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"☆21Jan 28, 2021Updated 5 years ago
- A PyTorch Implementation of Neural Turing Machine☆14Jul 24, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- simple keras implement for 《Memory Fusion Network for Multi-view Sequential Learning》☆14Apr 9, 2021Updated 4 years ago
- The relevant codes for "GANI: Global Attacks on Graph Neural Networks via Imperceptible Node Injections".☆14Mar 21, 2024Updated 2 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- a q-learning algorithms on packet routing.☆14Dec 1, 2018Updated 7 years ago
- Python and TensorFlow implementation of the paper "Learning Explanatory Rules from Noisy Data." Evans Richard and Edward Grefenstette. Jo…☆53May 16, 2021Updated 4 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆10Feb 22, 2024Updated 2 years ago
- Progressive Growing of Points with Tree-structured Generators (BMVC 2021)☆11Nov 1, 2023Updated 2 years ago
- Pytorch version of IEEE Transactions on Multimedia 2019: "Naturalness-Aware Deep No-Reference Image Quality Assessment."☆12Jun 30, 2020Updated 5 years ago
- gILC - An Open Source Tool for Model Based Iterative Learning Control☆15Apr 3, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of DropBlock in Pytorch☆82Nov 4, 2018Updated 7 years ago
- iccad contest 2022 problem B☆16Sep 4, 2022Updated 3 years ago
- A PyTorch implementation of deep Q-learning for Atari games☆13Dec 4, 2018Updated 7 years ago
- The code for NeurIPS 2023 paper DSR☆14Oct 8, 2023Updated 2 years ago
- An open-source tool for sequence learning in NLP built on TensorFlow.☆11Dec 23, 2021Updated 4 years ago
- PyTorch implementation of the intrinsic curiosity module (ICM) and A3C a;lgorithm☆22Oct 4, 2021Updated 4 years ago
- Personally make object detection dataset based on KonoHana Kitan cartoon character.☆10Jun 23, 2019Updated 6 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- 🦾Distributed Natural Evolution Strategies Build with PyTorch and Ray☆18Jul 20, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Jan 11, 2019Updated 7 years ago
- A paper list of sample-efficient reinforcement learning☆18Jan 12, 2022Updated 4 years ago
- A toy compiler for subset of c++ written in python☆16Jan 17, 2025Updated last year
- Tensorflow: Generalizing Across Domains via Cross-Gradient Training☆15May 11, 2018Updated 7 years ago
- A C++/CUDA toolkit for neural machine translation (RNN-Based NMT) across multiple GPUs☆10Oct 17, 2022Updated 3 years ago
- ☆10Jun 14, 2025Updated 9 months ago
- Implementation of paper "Parallelizable Stack Long Short-Term Memory"☆12Apr 8, 2019Updated 6 years ago