Tensorflow implementation of proximal policy optimization (PPO) algorithm
☆13Feb 28, 2018Updated 7 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below
Sorting:
- Proximal Policy Optimization implementation with TensorFlow☆108Oct 9, 2018Updated 7 years ago
- ☆16Nov 16, 2022Updated 3 years ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- This repository contains tutorial material on Doing DeepRL with PPO in GDG DevFest 2017 Seoul.☆22Nov 20, 2017Updated 8 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆22Jun 6, 2018Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Pytorch implementation of NASA: NEURAL ARTICULATED SHAPE APPROXIMATION☆12May 4, 2021Updated 4 years ago
- sample implementation of deeploco☆15Nov 5, 2018Updated 7 years ago
- ☆10May 5, 2021Updated 4 years ago
- Simple implementation of an AABB Tree (Axis Aligned Bounding Box Tree) to optimize 3d collision detection☆10Oct 22, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- [Review] Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environment☆10Dec 22, 2018Updated 7 years ago
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆11Jul 18, 2023Updated 2 years ago
- drafts of LSRs I intend to file, am filing, or have filed as a legislator☆11Feb 3, 2026Updated 3 weeks ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Tools to rebuild a VOXEL-enabled server and client.☆14Nov 11, 2021Updated 4 years ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- A Pytorch implementation of Pensieve (SIGCOMM'18)☆12Jun 17, 2020Updated 5 years ago
- low-latency foveated video encoding☆14Jun 21, 2023Updated 2 years ago
- Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.☆12Mar 9, 2021Updated 4 years ago
- ☆12Apr 12, 2022Updated 3 years ago
- ☆11Mar 5, 2024Updated last year
- ☆13May 29, 2018Updated 7 years ago
- [ACL2023] Official code repository for VLN-Trans☆14Sep 10, 2023Updated 2 years ago
- Save jpeg images in h5py☆13May 1, 2019Updated 6 years ago
- code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)☆10Jul 17, 2022Updated 3 years ago
- SpyGame: An interactive multi-agent framework to evaluate intelligence with large language models :D☆15Nov 9, 2023Updated 2 years ago
- A preliminary platform for up to 1 million reinforcement learning agents☆11Aug 27, 2017Updated 8 years ago
- 分布式调度系统,基于zookeeper ,netty,调度内核参考Spring schedule 执行表达式和Spring schedule一样,没有使用Quartz,客户端完全基于注解配置,使用同 Spring schedule一致,最少配置,使用简单☆14Feb 22, 2017Updated 9 years ago
- A short guide and example on how to fine-tune OpenAI's gpt-3.5-turbo for better roleplay☆14Aug 26, 2023Updated 2 years ago
- ☆13Dec 16, 2024Updated last year
- ppo-lstm-parallel☆49Mar 26, 2019Updated 6 years ago
- The Simplest and straightforward Tensorflow 2.0 implementation for vanilla GAN☆10Jul 6, 2019Updated 6 years ago
- A3C-LSTM algorithm tested on CartPole OpenAI Gym environment☆48Jul 4, 2018Updated 7 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆12Feb 22, 2019Updated 7 years ago
- Code repository of GreenABR for MMSys 2022 submission☆14Apr 6, 2022Updated 3 years ago