self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow
☆12Sep 1, 2017Updated 8 years ago
Alternatives and similar repositories for Tensorflow-DPPO
Users that are interested in Tensorflow-DPPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- This repository contains my MSc dissertation project. Iti s an implementation of a streaming GMM algorithm in Spark.☆11Aug 25, 2018Updated 7 years ago
- ☆30Oct 18, 2017Updated 8 years ago
- Matlab code for basic gait generator for students☆10Sep 25, 2020Updated 5 years ago
- This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).☆63Jul 30, 2018Updated 7 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Generate compile_commands.json and run clang-tidy with Bazel☆18Jun 23, 2019Updated 6 years ago
- Recurrent Network-based Deterministic Policy Gradient for Solving Bipedal Walking Challenge on Rugged Terrains☆12Oct 16, 2017Updated 8 years ago
- Finite State Machine Designer☆12Nov 17, 2017Updated 8 years ago
- Training a deep FCN network in PyTorch to route circuit layouts☆67Dec 7, 2022Updated 3 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- DDPG on OpenAI Gym Pendulum☆17Jul 1, 2016Updated 9 years ago
- Official Implementation of paper https://arxiv.org/abs/1801.02612☆13Jun 16, 2020Updated 5 years ago
- FFT Explorations (basic implementation)☆10Aug 8, 2014Updated 11 years ago
- Replicating Imagination-Augmented Agents for Deep Reinforcement Learning☆20Dec 17, 2017Updated 8 years ago
- WuBu Nesting Playground, Inspired by XJDR Entropy, Now Hyperbolic Math Focused☆26Mar 9, 2026Updated 2 weeks ago
- GEarthView plugin for QGis☆12Apr 11, 2018Updated 7 years ago
- Qt4 & Visual Studio 2015 (vc14).☆13Nov 20, 2016Updated 9 years ago
- AC3ESBrowser is a tool for analyzing ac3/eac3 bitstreams☆12Feb 27, 2015Updated 11 years ago
- My experimentations with Reinforcement Learning in Pytorch☆20May 18, 2017Updated 8 years ago
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- ☆10Jun 5, 2021Updated 4 years ago
- Experiments on Data Poisoning Regression Learning☆12Oct 5, 2020Updated 5 years ago
- ☆42Oct 19, 2021Updated 4 years ago
- msgpack-rpc + α for JavaScript language☆13Mar 8, 2022Updated 4 years ago
- ☆12Oct 5, 2022Updated 3 years ago
- Implementation of ICLR 2018 paper "Loss-aware Weight Quantization of Deep Networks"☆27Oct 24, 2019Updated 6 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- ☆18Jan 14, 2016Updated 10 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Maddpg_flight code☆11Jul 4, 2018Updated 7 years ago
- An RPG Maker MZ plugin☆12Nov 2, 2023Updated 2 years ago
- ☆54Jan 31, 2019Updated 7 years ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- Proof of concept code for DeepSteal (SP'22) Machine Learning model extraction (weight stealing) with memory side channel☆13Jun 22, 2023Updated 2 years ago
- Develop more, Code less. Propeller integration with Django. Propeller is a front-end responsive framework based on Google's Material Desi…☆31Jan 12, 2026Updated 2 months ago