An implementation of TRPO with GAE in PyTorch
☆16Jul 22, 2023Updated 2 years ago
Alternatives and similar repositories for trpo-pytorch
Users that are interested in trpo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆25Apr 10, 2020Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆48Jun 9, 2018Updated 7 years ago
- code for ‘Towards Long-term Fairness in Recommendation’☆23Sep 4, 2023Updated 2 years ago
- PyTorch implementation of Trust Region Policy Optimization☆448Sep 13, 2018Updated 7 years ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆14Apr 4, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆15Oct 9, 2022Updated 3 years ago
- Maze generation & solving with Python☆10Oct 2, 2021Updated 4 years ago
- Gamepad API Content Kit☆14Jun 1, 2016Updated 9 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Jul 26, 2022Updated 3 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Oct 30, 2020Updated 5 years ago
- ☆18Sep 7, 2023Updated 2 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Dec 30, 2019Updated 6 years ago
- ☆10Jul 28, 2023Updated 2 years ago
- ☆12Mar 28, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- The PackNet Continual Learning Method in Pytorch☆15Aug 19, 2021Updated 4 years ago
- Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)☆12Jun 18, 2022Updated 3 years ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 11 months ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆11Apr 24, 2018Updated 8 years ago
- ☆10Jun 27, 2017Updated 8 years ago
- RecAlpaca: A simple framework combing Alpaca and Recommendations.☆35Mar 30, 2023Updated 3 years ago
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A toy stereo visual inertial odometry (VIO) system☆15Apr 28, 2023Updated 3 years ago
- USAD model on UCR Time Series Anomaly Archive☆15Oct 22, 2021Updated 4 years ago
- Reinforcement Learning Benchmark☆13Sep 9, 2020Updated 5 years ago
- ☆12Mar 7, 2024Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Code for Policy Learning for Fairness in Ranking paper at NeurIPS 2019☆20Apr 20, 2022Updated 4 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 7 years ago
- A Dataset for Conversational Recommendation over KnowledgeGraph in E-commerce☆51Sep 26, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- optimize neuro-centric parameters instead of weights to solve RL tasks☆14Oct 2, 2023Updated 2 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆55Oct 18, 2021Updated 4 years ago
- RBDL - Rigid Body Dynamics Library☆12Jul 27, 2022Updated 3 years ago
- Simple implementation for Constrained Policy Optimization in Pytorch☆17Aug 27, 2022Updated 3 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆25Feb 2, 2025Updated last year
- Code and dataset for the paper "IsarStep: a Benchmark for High-level Mathematical Reasoning"☆12Mar 15, 2021Updated 5 years ago
- Code for Generalization Guarantees for (Multi-Modal) Imitation Learning☆11Jul 14, 2022Updated 3 years ago