A toy example of Policy Gradient implemented in Pytorch
☆95Jan 24, 2018Updated 8 years ago
Alternatives and similar repositories for pytorch-policy-gradient-example
Users that are interested in pytorch-policy-gradient-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 5 years ago
- Simple partially ordered sets for Julia☆10Jul 29, 2024Updated last year
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- Modular PyTorch implementation of policy gradient methods☆24Nov 15, 2018Updated 7 years ago
- ☆16May 11, 2017Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model-based Policy Gradients☆32Mar 12, 2020Updated 6 years ago
- MATLAB LMPC implementation for a double integrator system☆61Apr 26, 2021Updated 5 years ago
- Framework of DataLog Neural Program Synthesis☆27Apr 2, 2019Updated 7 years ago
- ☆11Jun 9, 2022Updated 4 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆62Jul 30, 2018Updated 7 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆97Mar 1, 2021Updated 5 years ago
- ☆20Apr 10, 2018Updated 8 years ago
- Faithful Python implementation of the paper "Towards Deep Symbolic Reinforcement Learning" by Garnelo et al.☆13Mar 23, 2021Updated 5 years ago
- Unifying sparse approximations for Gaussian process regression and classification, using Power EP☆22Oct 17, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Policy Gradient Actor-Critic PyTorch | Lunar Lander v2☆76May 7, 2019Updated 7 years ago
- Big data simulation of Chicago's public transportation to improve transit planning and reduce bus crowding☆22Apr 26, 2017Updated 9 years ago
- structured attention encoder☆13Jun 6, 2018Updated 8 years ago
- 课程笔记,David Silver,CS294 ...☆15Jan 7, 2019Updated 7 years ago
- A MATLAB implementation of the Proximally Stabilized Fischer-Burmeister (FBstab) quadratic programming solver☆12Jan 27, 2022Updated 4 years ago
- Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras☆160Dec 26, 2019Updated 6 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- ☆11Jan 27, 2018Updated 8 years ago
- GiMeFive: Towards Interpretable Facial Emotion Classification 😄😲😭😡🤢😨 (PyTorch Implementation)☆15Jul 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆50Jun 4, 2026Updated 2 weeks ago
- Randomized Smoothing of All Shapes and Sizes (ICML 2020).☆51Jul 23, 2020Updated 5 years ago
- Datasets for compositional learning☆11Nov 28, 2018Updated 7 years ago
- A repository for code of reinforcement learning algorithms with PyTorch☆30Sep 20, 2021Updated 4 years ago
- Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"☆21Jan 28, 2021Updated 5 years ago
- Image Captioning in Chinese☆11Jul 2, 2017Updated 8 years ago
- The relevant codes for "GANI: Global Attacks on Graph Neural Networks via Imperceptible Node Injections".☆14Mar 21, 2024Updated 2 years ago
- Reinforcement Learning via Latent State Decoding☆29Jun 12, 2023Updated 3 years ago
- ☆15Feb 8, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Jan 4, 2025Updated last year
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆10Feb 22, 2024Updated 2 years ago
- This repo contains code for *Merging and Evolution: Improving Convolutional Neural Networks for Mobile Applications*.☆11May 3, 2018Updated 8 years ago
- ☆66May 25, 2020Updated 6 years ago
- Pytorch version of IEEE Transactions on Multimedia 2019: "Naturalness-Aware Deep No-Reference Image Quality Assessment."☆12Jun 30, 2020Updated 5 years ago
- gILC - An Open Source Tool for Model Based Iterative Learning Control☆15Apr 3, 2019Updated 7 years ago
- Implementation of DropBlock in Pytorch☆82Nov 4, 2018Updated 7 years ago