Modular PyTorch implementation of policy gradient methods
☆25Nov 15, 2018Updated 7 years ago
Alternatives and similar repositories for policy-gradient-methods
Users that are interested in policy-gradient-methods are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA☆13Nov 16, 2025Updated 4 months ago
- A python implementation of tile coding using numpy.☆11May 13, 2017Updated 8 years ago
- Code for generating options for planning and reinforcement learning☆12Feb 18, 2021Updated 5 years ago
- ☆12Dec 23, 2022Updated 3 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- A Prior of a Googol Gaussians: a Tensor Ring Induced Prior for Generative Models☆29Jun 17, 2024Updated last year
- Compression-based decentralized stochastic gradient descent (DSGD) algorithms tailored for digital and analog wireless implementations☆13Jun 26, 2022Updated 3 years ago
- Simulation code for Federated Learning with Over-the-Air Computation.☆11Sep 11, 2020Updated 5 years ago
- ☆14Jan 4, 2025Updated last year
- Minimal PyTorch Library for Natural Evolution Strategies☆18Sep 29, 2021Updated 4 years ago
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 4 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆100Jul 23, 2019Updated 6 years ago
- Binary floating-point formats in Go (IEEE 754 half and quadruple precision, x86 extended precision and PowerPC quadruple precision with d…☆23Dec 12, 2021Updated 4 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Online Variance Reduction☆15May 9, 2019Updated 6 years ago
- This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…☆13May 29, 2021Updated 4 years ago
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15May 17, 2017Updated 8 years ago
- Counterfactual Regret Minimization (CFR) sample code in Python☆14Apr 16, 2019Updated 6 years ago
- Adaptive Heuristic Method Based on SA and LNS for Solving Vehicle Routing Problem☆13Oct 9, 2017Updated 8 years ago
- Pytorch package for geometric softmax☆12Jun 13, 2019Updated 6 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- Kirsche, connecting your references.☆14Aug 20, 2024Updated last year
- Code for "Best arm identification in multi-armed bandits with delayed feedback", AISTATS 2018.☆20Apr 3, 2018Updated 7 years ago
- Implementation of Counterfactual risk minimization☆26Apr 13, 2017Updated 8 years ago
- Keras style progressbar for Pytorch (PK Bar)☆32May 15, 2024Updated last year
- 🍼 Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models (BabyLM Challenge)☆17Jan 10, 2025Updated last year
- Boiler plate code for Torch based ML projects☆10Jul 14, 2021Updated 4 years ago
- ☆13May 16, 2019Updated 6 years ago
- ☆24Nov 27, 2020Updated 5 years ago
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆13Sep 22, 2023Updated 2 years ago
- Project exploring Multi Task Deep Reinforcement Learning neural network architectures and algorithms with Open AI Gym and TensorFlow☆17Sep 5, 2018Updated 7 years ago
- WebRTC LXJS 2014 Workshop☆27Jun 27, 2014Updated 11 years ago
- Reinforcement Learning using Policy Gradient to solve OpenAI Gym games☆112Dec 13, 2017Updated 8 years ago
- A toy example of Policy Gradient implemented in Pytorch☆95Jan 24, 2018Updated 8 years ago
- A TF2.0 implementation of RL baselines.☆10Sep 24, 2021Updated 4 years ago
- ☆11Oct 13, 2017Updated 8 years ago
- Improving Neural Network Training in Low Dimensional Random Bases☆13Nov 17, 2020Updated 5 years ago
- Code Repo for paper Label Leakage and Protection in Two-party Split Learning (ICLR 2022).☆22Mar 12, 2022Updated 4 years ago