Trust Region Policy Optimization (TRPO) in pure TensorFlow
☆18Jun 7, 2018Updated 7 years ago
Alternatives and similar repositories for TRPO-TensorFlow
Users that are interested in TRPO-TensorFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the paper "Divergence-Augmented Policy Optimization"☆37Nov 28, 2019Updated 6 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Jul 3, 2018Updated 7 years ago
- ☆15Apr 20, 2020Updated 6 years ago
- TensorFlow input pipelines for multiple datasets for easy data fetching☆54Dec 19, 2016Updated 9 years ago
- Maze generation & solving with Python☆10Oct 2, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Utilities for working with videos☆13Jul 5, 2025Updated 10 months ago
- [IEEE TSIPN' 2022] "Scalable Perception-Action-Communication Loops with Convolutional and Graph Neural Networks", by Ting-Kuei Hu, Fernan…☆15Feb 4, 2022Updated 4 years ago
- An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods - EWRL Workshop 2018☆15Oct 28, 2018Updated 7 years ago
- ☆13May 16, 2019Updated 7 years ago
- ☆12Dec 7, 2017Updated 8 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Oct 31, 2018Updated 7 years ago
- Panorama stitching of images or real-time video streams☆10Aug 12, 2020Updated 5 years ago
- A Genetic Algorithms framework for Hadoop MapReduce.☆10May 30, 2018Updated 7 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Jun 2, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Nov 8, 2017Updated 8 years ago
- [VISAPP 2022] MdVRNet: Deep Video Restoration under Multiple Distortions☆12Aug 7, 2024Updated last year
- a demo to transport video by UDP☆12Jan 30, 2019Updated 7 years ago
- Annealed Importance Sampling (AIS) for generative models.☆16Jul 20, 2018Updated 7 years ago
- Experiments from "The Description Length of Deep Learning Models"☆10Aug 1, 2018Updated 7 years ago
- code release for the NIPS 2016 paper☆27Oct 21, 2016Updated 9 years ago
- ICADCML 2021 A Novel Approach to Encrypt Data using Deep Neural Networks☆13Mar 25, 2023Updated 3 years ago
- A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow☆182Feb 10, 2019Updated 7 years ago
- Official codes for "Training Deep Q-Network via Monte Carlo Tree Search for Adaptive Bitrate Control in Video Delivery"☆10Jul 21, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆103Mar 6, 2025Updated last year
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆99Jun 22, 2020Updated 5 years ago
- A Chainer implementation of WGAN-GP.☆12Oct 4, 2017Updated 8 years ago
- A fully trainable BP decoder, enabling the discovery of new parity check matrix through automatic learning☆14Sep 26, 2022Updated 3 years ago
- ☆99Aug 15, 2016Updated 9 years ago
- Parallel Particle Swarm Optimizer on the Spark Clustering Computing Platform.☆12Oct 29, 2018Updated 7 years ago
- Patch-wise iterative attack (accepted by ECCV 2020) to improve the transferability of adversarial examples.☆94Mar 13, 2022Updated 4 years ago
- Tools for robustness evaluation in interpretability methods☆10Jun 25, 2021Updated 4 years ago
- Manually transpilated C++ code from ns-3 manet routing example to Python.☆13Dec 17, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A tensorflow implementation of the NIPS 2018 paper "Variational Inference with Tail-adaptive f-Divergence"☆20Jan 11, 2019Updated 7 years ago
- Minimal implementation of the network layers of the paper "Noisy Networks for Exploration" using Pytorch.☆13Mar 15, 2025Updated last year
- Implementation of safe offline bandit algorithms.☆10Oct 27, 2019Updated 6 years ago
- Reinforcement Learning for Bit Flipping decoding of linear codes☆14Sep 12, 2020Updated 5 years ago
- Code release for the paper "Calibrating Energy-based Generative Adversarial Networks"☆24Oct 31, 2017Updated 8 years ago
- Code for the RL method MATD3 described in the paper "Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics…☆95Nov 30, 2020Updated 5 years ago
- image classification via video input, frame-by-frame☆18Aug 11, 2017Updated 8 years ago