Modular PyTorch implementation of policy gradient methods
☆24Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for policy-gradient-methods
Users that are interested in policy-gradient-methods are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- ☆13Jan 1, 2018Updated 8 years ago
- Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA☆13Nov 16, 2025Updated 6 months ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- Implementation of original Benders procedures in Python☆10Apr 6, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Stochastic Variance Reduction Policy Gradient Estimation☆11Nov 6, 2018Updated 7 years ago
- ☆11Apr 20, 2021Updated 5 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models☆29Jun 17, 2024Updated last year
- Simulation code for Federated Learning with Over-the-Air Computation.☆11Sep 11, 2020Updated 5 years ago
- Minimal PyTorch Library for Natural Evolution Strategies☆18Sep 29, 2021Updated 4 years ago
- Human Activity Recognition with LSTM model and MLFlow Tracking☆11Jun 6, 2022Updated 4 years ago
- NLP course at Chulalongkorn University 2019☆21Mar 28, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 5 years ago
- Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…☆23Dec 12, 2021Updated 4 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Stochastic Gradient Markov Chain Monte Carlo and Optimisation☆17Mar 21, 2017Updated 9 years ago
- Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe☆16Nov 4, 2017Updated 8 years ago
- Online Variance Reduction☆15May 9, 2019Updated 7 years ago
- ☆17May 16, 2018Updated 8 years ago
- This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…☆13May 29, 2021Updated 5 years ago
- ☆45Nov 3, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15May 17, 2017Updated 9 years ago
- Pytorch package for geometric softmax☆12Jun 13, 2019Updated 7 years ago
- Counterfactual Regret Minimization (CFR) sample code in Python☆14Apr 16, 2019Updated 7 years ago
- Adaptive Heuristic Method Based on SA and LNS for Solving Vehicle Routing Problem☆13Oct 9, 2017Updated 8 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- A novel variant of sliced Wasserstein based on a new slicing technique that utilizes the convolution operator.☆12Jan 14, 2023Updated 3 years ago
- To convert a 2D image into 3D image and make it move.☆10Mar 3, 2019Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- PyTorch implementation of Trust Region Policy Optimization☆448Sep 13, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13May 16, 2019Updated 7 years ago
- ☆17Sep 17, 2023Updated 2 years ago
- The software used in NeurIPS 2022 Paper Don't Pour Cereal into Coffee: Differentiable Temporal Logic for Temporal Action Segmentation.☆20Aug 22, 2023Updated 2 years ago
- ☆24Nov 27, 2020Updated 5 years ago
- ☆12Jan 10, 2023Updated 3 years ago
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆12Sep 22, 2023Updated 2 years ago
- Experience-embedded Visual Foresight, CoRL 2019☆14Nov 13, 2019Updated 6 years ago