Tensorflow implementation of proximal policy optimization (PPO) algorithm
☆13Feb 28, 2018Updated 8 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Proximal Policy Optimization implementation with TensorFlow☆108Oct 9, 2018Updated 7 years ago
- ☆17Nov 16, 2022Updated 3 years ago
- Proximal Policy Optimization with TensorFlow and OpenAI Gym☆18Mar 31, 2018Updated 8 years ago
- A C++ implementation of the asynchronous advantage actor-critic (A3C) algorithm☆23Mar 17, 2020Updated 6 years ago
- Deep Developmental Reinforcement Learning☆29Jul 1, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- Reinforcement learning algorithms with Generalized Advantage Estimation☆22Jun 6, 2018Updated 7 years ago
- Turns FBX Animation file into a DeepMimic motion file☆12Apr 3, 2021Updated 5 years ago
- Learning how to ride a bicycle using reinforcement learning.☆13Dec 11, 2013Updated 12 years ago
- You can physically simulate a dove in this program which was developed for "Data-driven Control of Flapping Fight, ACM Transactions on Gr…☆12Dec 6, 2019Updated 6 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- sample implementation of deeploco☆15Nov 5, 2018Updated 7 years ago
- Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.☆12Mar 9, 2021Updated 5 years ago
- Collection of Physics-based simulations☆67Jun 22, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- code for the paper "ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts" (CVPR 2022)☆10Jul 17, 2022Updated 3 years ago
- Alber Deep Learning☆12Sep 25, 2017Updated 8 years ago
- Implemenation of DDPG with numpy only (without Tensorflow)☆14Mar 4, 2018Updated 8 years ago
- Load motion capture data into Roboschool/MuJoCo☆17Oct 28, 2017Updated 8 years ago
- VSCode extension for viewing and testing a URDF file☆18Oct 12, 2018Updated 7 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Code accompanying our SIGGRAPH 2021 Technical Communications paper "Transition Motion Tensor: A Data-Driven Approach for Versatile and Co…☆12Dec 10, 2021Updated 4 years ago
- Official implementation of Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts(IJCAI 2024)☆15Oct 16, 2024Updated last year
- Converting .bvh files to DeepMimic animations☆15Jan 23, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Nov 9, 2019Updated 6 years ago
- ☆10Nov 4, 2019Updated 6 years ago
- ITU-T Rec. P.1203 Codec Extension to VP9 and HEVC☆14Mar 16, 2020Updated 6 years ago
- Research project on reinforcement learning models for physics-based character locomotion☆21Dec 4, 2018Updated 7 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A python interface for ODE to simulate robots☆21Dec 27, 2019Updated 6 years ago
- ☆10May 5, 2021Updated 5 years ago
- parse bvh file.☆14Dec 7, 2017Updated 8 years ago
- Code for CVPR22 paper One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones☆13Jul 27, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- low-latency foveated video encoding☆14Jun 21, 2023Updated 2 years ago
- ☆13Sep 23, 2023Updated 2 years ago
- A preliminary platform for up to 1 million reinforcement learning agents☆11Aug 27, 2017Updated 8 years ago
- Automating human walk cycles using machine learning☆65Sep 26, 2014Updated 11 years ago
- drafts of LSRs I intend to file, am filing, or have filed as a legislator☆11Mar 21, 2026Updated 2 months ago
- 分布式调度系统,基于zookeeper ,netty,调度内核参考Spring schedule 执行表达式和Spring schedule一样,没有使用Quartz,客户端完全基于注解配置,使用同 Spring schedule一致,最少配置,使用简单☆14Feb 22, 2017Updated 9 years ago
- ☆13May 29, 2018Updated 7 years ago