Modular PyTorch implementation of policy gradient methods
☆24Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for policy-gradient-methods
Users that are interested in policy-gradient-methods are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- ☆13Jan 1, 2018Updated 8 years ago
- Implementation of original Benders procedures in Python☆10Apr 6, 2019Updated 7 years ago
- ☆11Apr 20, 2021Updated 5 years ago
- Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.☆17Aug 26, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- ☆13Mar 17, 2024Updated 2 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- code of our work : Adaptive Exploration for Unsupervised Person Re-Identification☆30Mar 6, 2021Updated 5 years ago
- Compression-based decentralized stochastic gradient descent (DSGD) algorithms tailored for digital and analog wireless implementations☆13Jun 26, 2022Updated 3 years ago
- Minimal PyTorch Library for Natural Evolution Strategies☆18Sep 29, 2021Updated 4 years ago
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 5 years ago
- Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…☆23Dec 12, 2021Updated 4 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe☆16Nov 4, 2017Updated 8 years ago
- Online Variance Reduction☆15May 9, 2019Updated 6 years ago
- ☆45Nov 3, 2019Updated 6 years ago
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15May 17, 2017Updated 8 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 7 years ago
- PyTorch implementation of ICML 2017 paper, SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Paral…☆17Oct 24, 2017Updated 8 years ago
- Counterfactual Regret Minimization (CFR) sample code in Python☆14Apr 16, 2019Updated 7 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A novel variant of sliced Wasserstein based on a new slicing technique that utilizes the convolution operator.☆12Jan 14, 2023Updated 3 years ago
- Implementation of Counterfactual risk minimization☆26Apr 13, 2017Updated 9 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- PyTorch implementation of Trust Region Policy Optimization☆451Sep 13, 2018Updated 7 years ago
- ☆13May 16, 2019Updated 6 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Nov 13, 2019Updated 6 years ago
- 哈工大计算机课件☆22Nov 3, 2022Updated 3 years ago
- WebRTC LXJS 2014 Workshop☆27Jun 27, 2014Updated 11 years ago
- Reinforcement Learning using Policy Gradient to solve OpenAI Gym games☆112Dec 13, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆12Jul 18, 2023Updated 2 years ago
- Pretrained models for the ranking task described in Cats and Captions vs. Creators and the Clock (WWW 2017)☆11Apr 28, 2019Updated 7 years ago
- ☆13Jul 22, 2021Updated 4 years ago
- ☆10Dec 30, 2022Updated 3 years ago
- A TF2.0 implementation of RL baselines.☆10Sep 24, 2021Updated 4 years ago
- ☆11Oct 13, 2017Updated 8 years ago
- Code Repo for paper Label Leakage and Protection in Two-party Split Learning (ICLR 2022).☆22Mar 12, 2022Updated 4 years ago