An implementation of TRPO with GAE in PyTorch
☆16Jul 22, 2023Updated 2 years ago
Alternatives and similar repositories for trpo-pytorch
Users that are interested in trpo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆25Apr 10, 2020Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Jun 9, 2018Updated 7 years ago
- code for ‘Towards Long-term Fairness in Recommendation’☆23Sep 4, 2023Updated 2 years ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆14Apr 4, 2020Updated 6 years ago
- implement n2nmn with pytorch☆19Apr 10, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Maze generation & solving with Python☆10Oct 2, 2021Updated 4 years ago
- Gamepad API Content Kit☆14Jun 1, 2016Updated 9 years ago
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Jul 26, 2022Updated 3 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Oct 30, 2020Updated 5 years ago
- PyTorch implementation of Deterministic Generative Adversarial Imitation Learning (GAIL) for Off Policy learning☆67Dec 30, 2019Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Aug 30, 2024Updated last year
- Code for IEEE transactions on neural networks and learning system☆13Jun 18, 2021Updated 4 years ago
- ☆10Jul 28, 2023Updated 2 years ago
- Code for "Generative causal explanations of black-box classifiers"☆36Jan 15, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Mar 28, 2023Updated 3 years ago
- Optimized dqn for caffe☆11Dec 18, 2015Updated 10 years ago
- True Sublime Text style multiple selections for Vim☆64Dec 3, 2014Updated 11 years ago
- The PackNet Continual Learning Method in Pytorch☆15Aug 19, 2021Updated 4 years ago
- Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)☆12Jun 18, 2022Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆51May 26, 2021Updated 4 years ago
- Atomic crystal structures for Julia☆21Aug 15, 2018Updated 7 years ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 11 months ago
- A markdown-it plug-in for rendering citations and a bibliography inside markdown☆12Dec 1, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Oct 19, 2020Updated 5 years ago
- Java JNI binding for mujoco physics system☆15Mar 18, 2025Updated last year
- ☆11Apr 24, 2018Updated 8 years ago
- ☆10Jun 27, 2017Updated 8 years ago
- RecAlpaca: A simple framework combing Alpaca and Recommendations.☆35Mar 30, 2023Updated 3 years ago
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 7 years ago
- A toy stereo visual inertial odometry (VIO) system☆15Apr 28, 2023Updated 3 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- Contains the PennyLane ProjectQ plugin. This plugin provides three devices to work with PennyLane - the ProjectQ IBM device, the ProjectQ…☆17Oct 28, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- DEPRECATED - See select2/docs for the new documentation website☆11Sep 10, 2017Updated 8 years ago
- ☆12Mar 7, 2024Updated 2 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Recommendation system with actor and critic