An implementation of TRPO with GAE in PyTorch
☆16Jul 22, 2023Updated 2 years ago
Alternatives and similar repositories for trpo-pytorch
Users that are interested in trpo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆25Apr 10, 2020Updated 5 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Jun 9, 2018Updated 7 years ago
- code for ‘Towards Long-term Fairness in Recommendation’☆23Sep 4, 2023Updated 2 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Sep 13, 2018Updated 7 years ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆14Apr 4, 2020Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆15Oct 9, 2022Updated 3 years ago
- LaTeX template for Rutgers University Computer Science thesis☆23Nov 10, 2019Updated 6 years ago
- Maze generation & solving with Python☆10Oct 2, 2021Updated 4 years ago
- ☆18Sep 7, 2023Updated 2 years ago
- E3xSO3 convolution implementation presented at MIDL 2023 https://openreview.net/pdf?id=lri_iAbpn_r☆14Apr 12, 2023Updated 2 years ago
- Code for IEEE transactions on neural networks and learning system☆13Jun 18, 2021Updated 4 years ago
- ☆10Jul 28, 2023Updated 2 years ago
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- Optimized dqn for caffe☆11Dec 18, 2015Updated 10 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- True Sublime Text style multiple selections for Vim☆64Dec 3, 2014Updated 11 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- The PackNet Continual Learning Method in Pytorch☆15Aug 19, 2021Updated 4 years ago
- Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)☆12Jun 18, 2022Updated 3 years ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 9 months ago
- A markdown-it plug-in for rendering citations and a bibliography inside markdown☆12Dec 1, 2024Updated last year
- ☆11Oct 19, 2020Updated 5 years ago
- Java JNI binding for mujoco physics system☆14Mar 18, 2025Updated last year
- Scripts used to recreate the results of the ISMRM 2015 Tractography Challenge☆18Dec 9, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- RecAlpaca: A simple framework combing Alpaca and Recommendations.☆35Mar 30, 2023Updated 2 years ago
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 7 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- A toy stereo visual inertial odometry (VIO) system☆14Apr 28, 2023Updated 2 years ago
- OpenAI ROS☆12Mar 7, 2019Updated 7 years ago
- USAD model on UCR Time Series Anomaly Archive☆14Oct 22, 2021Updated 4 years ago
- Reinforcement Learning Benchmark☆13Sep 9, 2020Updated 5 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- DEPRECATED - See select2/docs for the new documentation website☆11Sep 10, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Oct 18, 2021Updated 4 years ago
- Code for Policy Learning for Fairness in Ranking paper at NeurIPS 2019☆20Apr 20, 2022Updated 3 years ago
- Recommendation system with actor and critic☆18Aug 10, 2022Updated 3 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Feb 21, 2019Updated 7 years ago
- Robotarium quadcopter simulator in python.☆10Jan 10, 2022Updated 4 years ago
- A library for random feature maps in Python.☆17Aug 27, 2020Updated 5 years ago