A toy example of Policy Gradient implemented in Pytorch
☆95Jan 24, 2018Updated 8 years ago
Alternatives and similar repositories for pytorch-policy-gradient-example
Users that are interested in pytorch-policy-gradient-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 5 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- Modular PyTorch implementation of policy gradient methods☆24Nov 15, 2018Updated 7 years ago
- An event-based on-line adaptable fast nonlinear model predictive control framework☆26Oct 26, 2018Updated 7 years ago
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MATLAB LMPC implementation for a double integrator system☆61Apr 26, 2021Updated 5 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆97Mar 1, 2021Updated 5 years ago
- ☆20Apr 10, 2018Updated 8 years ago
- Faithful Python implementation of the paper "Towards Deep Symbolic Reinforcement Learning" by Garnelo et al.☆13Mar 23, 2021Updated 5 years ago
- Policy Gradient Actor-Critic PyTorch | Lunar Lander v2☆76May 7, 2019Updated 7 years ago
- Optimization models and their applications in power systems☆14Jan 11, 2018Updated 8 years ago
- structured attention encoder☆13Jun 6, 2018Updated 7 years ago
- Generates a zip archive that is uploadable to arXiv.☆46Feb 19, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A very simple build script for bare metal arm toolchain. NO LINUX!☆22Jan 6, 2013Updated 13 years ago
- ☆12May 21, 2017Updated 9 years ago
- 课程笔记,David Silver,CS294 ...☆15Jan 7, 2019Updated 7 years ago
- Antonino Furnari's fork of Feichtenhofer's gpu_flow, with temporal dilation.☆10Sep 18, 2020Updated 5 years ago
- A MATLAB implementation of the Proximally Stabilized Fischer-Burmeister (FBstab) quadratic programming solver☆12Jan 27, 2022Updated 4 years ago
- Implementation of the unary leapfrog join for efficient intersection of sorted sets.☆10Dec 4, 2019Updated 6 years ago
- ☆10Jun 16, 2025Updated 11 months ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- The project consists of a image processing application that is using distributed processors (MPI). The development language is C/C++ with…☆13Mar 26, 2012Updated 14 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- hierarchical Q-learning implementation☆11Jun 9, 2015Updated 10 years ago
- IP prototyping in FPGA hardware☆18Aug 28, 2018Updated 7 years ago
- ☆11Jan 27, 2018Updated 8 years ago
- ☆48Apr 4, 2026Updated last month
- BESPOKV: Application-Tailored Flexible Key-Value Store for HPC☆12Aug 28, 2018Updated 7 years ago
- Randomized Smoothing of All Shapes and Sizes (ICML 2020).☆51Jul 23, 2020Updated 5 years ago
- Datasets for compositional learning☆11Nov 28, 2018Updated 7 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Sep 20, 2021Updated 4 years ago
- A Parallel Simulation Framework For Multicore Systems☆11May 20, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- simple keras implement for 《Memory Fusion Network for Multi-view Sequential Learning》☆14Apr 9, 2021Updated 5 years ago
- This is the source code for our (Matthias Jasny, Lasse Thostrup, Tobias Ziegler and Carsten Binnig) published paper at SIGMOD’22: P4DB - …☆13Jan 24, 2023Updated 3 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- visual dialog model in pytorch☆110May 16, 2018Updated 8 years ago
- ☆15Jan 4, 2025Updated last year
- Meme search engine built with Jina neural search framework. Search with captions or image files to find matching memes.☆24Jun 10, 2022Updated 3 years ago
- Codebase used to generate the results for NeurIPS23 "Adversarial Training for Graph Neural Networks: Pitfalls, Solutions, and New Directi…☆13Dec 8, 2023Updated 2 years ago