Implementation of proximal policy optimization(PPO) with tensorflow
☆35Feb 10, 2018Updated 8 years ago
Alternatives and similar repositories for ppo_tf
Users that are interested in ppo_tf are comparing it to the libraries listed below
Sorting:
- Proximal Policy Optimization implementation with TensorFlow☆108Oct 9, 2018Updated 7 years ago
- Proximal Policy Optimization with TensorFlow and OpenAI Gym☆18Mar 31, 2018Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Nov 15, 2019Updated 6 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆22Jun 6, 2018Updated 7 years ago
- Tensorflow implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆25Apr 20, 2017Updated 8 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Integrating opencv with mujoco.☆11Mar 25, 2025Updated 11 months ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- Template for barebones flask projects☆11May 2, 2023Updated 2 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago
- ☆14Aug 18, 2023Updated 2 years ago
- ☆10May 28, 2018Updated 7 years ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- ☆16Mar 14, 2025Updated 11 months ago
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detection☆10Oct 22, 2024Updated last year
- Sequential Monte Carlo sampler for PyMC2 models.☆13Apr 4, 2018Updated 7 years ago
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆11Jul 18, 2023Updated 2 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Reimplementation of simple policy gradient algorithms such as REINFORCE and Actor-Critic methods.☆16Aug 26, 2023Updated 2 years ago
- [Review] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environment☆10Dec 22, 2018Updated 7 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- Implementation of safe offline bandit algorithms.☆10Oct 27, 2019Updated 6 years ago
- Meta Reinforcement Learning Experiments☆35Aug 22, 2017Updated 8 years ago
- Official codebase for our paper "Joslim: Joint Widths and Weights Optimization for Slimmable Neural Networks"☆12Jun 30, 2021Updated 4 years ago
- A Python library for parsing OSM streams.☆15May 8, 2021Updated 4 years ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- An attempt to reverse engineer custom file formats used by the game Outlaws from LucasArts.☆16Nov 3, 2018Updated 7 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Aug 3, 2020Updated 5 years ago
- A PyTorch implement of Dilated RNN☆11Dec 31, 2017Updated 8 years ago
- This repository contains implementation of A2C with GAE, which is used to control robot in MuJoCo environment.☆10Jan 6, 2020Updated 6 years ago
- Code for our SIGIR'2017 paper "Neural Rating Regression with Abstractive Tips Generation for Recommendation"☆14Jul 24, 2020Updated 5 years ago
- These are my learning algorithm solutions to OpenAI Gym environments.☆11May 9, 2017Updated 8 years ago
- A bottom-up model for the simulation of heat demand profiles of urban areas☆13Dec 11, 2023Updated 2 years ago
- ☆11Mar 5, 2024Updated last year
- [WIP] Playing Hard Exploration Games by Watching YouTube (Aytar et al., 2018)☆12Jan 31, 2019Updated 7 years ago
- send and receive message and file by python3 socket☆12May 24, 2018Updated 7 years ago