Modular PyTorch implementation of policy gradient methods
☆24Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for policy-gradient-methods
Users that are interested in policy-gradient-methods are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA☆12Nov 16, 2025Updated 7 months ago
- A python implementation of tile coding using numpy.☆11May 13, 2017Updated 9 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Implementation of original Benders procedures in Python☆10Apr 6, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- ☆11Apr 20, 2021Updated 5 years ago
- Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.☆17Aug 26, 2023Updated 2 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- Simulation code for Federated Learning with Over-the-Air Computation.☆11Sep 11, 2020Updated 5 years ago
- Minimal PyTorch Library for Natural Evolution Strategies☆18Sep 29, 2021Updated 4 years ago
- Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…☆23Dec 12, 2021Updated 4 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆94Sep 13, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe☆16Nov 4, 2017Updated 8 years ago
- Online Variance Reduction☆15May 9, 2019Updated 7 years ago
- ☆17May 16, 2018Updated 8 years ago
- ☆45Nov 3, 2019Updated 6 years ago
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15May 17, 2017Updated 9 years ago
- Codes for Stackelberg GAN☆15Apr 23, 2019Updated 7 years ago
- PyTorch implementation of ICML 2017 paper, SplitNet: Learning to Semantically Split Deep Networks for Parameter Reduction and Model Paral…☆17Oct 24, 2017Updated 8 years ago
- Pytorch package for geometric softmax☆12Jun 13, 2019Updated 7 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆20Apr 3, 2018Updated 8 years ago
- To convert a 2D image into 3D image and make it move.☆10Mar 3, 2019Updated 7 years ago
- Keras style progressbar for Pytorch (PK Bar)☆32May 15, 2024Updated 2 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆23Oct 26, 2018Updated 7 years ago
- Boiler plate code for Torch based ML projects☆10Jul 14, 2021Updated 4 years ago
- PyTorch implementation of Trust Region Policy Optimization☆448Sep 13, 2018Updated 7 years ago
- The software used in NeurIPS 2022 Paper Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation.☆20Aug 22, 2023Updated 2 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Nov 13, 2019Updated 6 years ago
- 哈工大计算机课件☆21Nov 3, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Jul 28, 2022Updated 3 years ago
- WebRTC LXJS 2014 Workshop☆27Jun 27, 2014Updated 12 years ago
- Reinforcement Learning using Policy Gradient to solve OpenAI Gym games☆112Dec 13, 2017Updated 8 years ago
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆12Jul 18, 2023Updated 2 years ago
- ☆13Jul 22, 2021Updated 4 years ago
- A TF2.0 implementation of RL baselines.☆10Sep 24, 2021Updated 4 years ago
- ☆11Oct 13, 2017Updated 8 years ago