An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch
☆25Apr 10, 2020Updated 5 years ago
Alternatives and similar repositories for cpo-pytorch
Users that are interested in cpo-pytorch are comparing it to the libraries listed below
Sorting:
- Constrained Policy Optimization☆336Jun 7, 2017Updated 8 years ago
- PyTorch implementation of Constrained Policy Optimization☆56Oct 19, 2021Updated 4 years ago
- An implementation of TRPO with GAE in PyTorch☆16Jul 22, 2023Updated 2 years ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆29Dec 9, 2021Updated 4 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆458Apr 2, 2023Updated 2 years ago
- Constrained Exploration and Recovery from Experience Shaping☆22Apr 18, 2019Updated 6 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 4 years ago
- An evolutionary algorithm-based optimization for tracking weights in the OpenSim Residual Reduction Algorithm (RRA).☆11Jul 17, 2023Updated 2 years ago
- ☆11May 25, 2023Updated 2 years ago
- workspace comprising demo packages for our roscon2018 talk☆10Dec 21, 2019Updated 6 years ago
- A pytorch implementation of Constrained Reinforcement Learning Algorithm, including Constrained Soft Actor Critic (Soft Actor Critic Lagr…☆45May 30, 2023Updated 2 years ago
- Benchmark present methods for efficient reinforcement learning. Methods include Reptile, MAML, Residual Policy, etc. RL algorithms includ…☆32Jan 19, 2023Updated 3 years ago
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning☆10Nov 14, 2021Updated 4 years ago
- Balanced K-means in Pytorch with strong GPU acceleration☆12Apr 30, 2020Updated 5 years ago
- MobaXterm注册机☆12Jan 3, 2024Updated 2 years ago
- Code for "So similar and yet incompatible: Toward the automated identification of semantically compatible words" in NAACL 2015 proceedi…☆11May 11, 2015Updated 10 years ago
- Just a package with lots of files and testing stuff with moveit and grasping related things with REEM☆11Feb 4, 2016Updated 10 years ago
- My Udacity Machine Learning Nanodegree capstone project in Reinforcement Learning☆10Dec 1, 2017Updated 8 years ago
- ☆13May 30, 2021Updated 4 years ago
- Work towards creating a common JSON based format for compact network specification☆14Jan 6, 2026Updated 2 months ago
- hierarchical deep reinforcement learning algorithms☆43Dec 12, 2017Updated 8 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- ros2 differential drive robot☆10Jan 14, 2021Updated 5 years ago
- Code for ICLR 2022 publication: Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL. https://openreview…☆10Aug 31, 2024Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- ☆12Oct 16, 2024Updated last year
- ☆11Apr 14, 2022Updated 3 years ago
- learn unreal engine 4☆12Feb 13, 2020Updated 6 years ago
- Data Driven Dynamic Hybrid Renewable Energy design and simulation framework☆12May 5, 2020Updated 5 years ago
- SymPy based framework for optimized code generation for BSSN formulation of Einstein equation for heterogeneous platforms.☆11Aug 18, 2025Updated 6 months ago
- Monomi designer and planner prototype☆14Feb 16, 2015Updated 11 years ago
- Python electromagnetic cosimulation library☆12Nov 5, 2025Updated 4 months ago
- Simulating neural network with Celery and Docker-Swarm. / 使用 Celery 和 Docker Swarm 建構分散式系統 模擬類神經網路☆12Jan 26, 2017Updated 9 years ago
- Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds☆11Oct 28, 2019Updated 6 years ago
- 关于混合高斯模型的期望最大算法的实现☆11Aug 24, 2018Updated 7 years ago
- ☆10Apr 13, 2023Updated 2 years ago
- MapReduce Streaming TSQR Implementation☆17Jun 16, 2015Updated 10 years ago