MahanFathi/TRPO-TensorFlow

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MahanFathi/TRPO-TensorFlow)

MahanFathi / TRPO-TensorFlow

Trust Region Policy Optimization (TRPO) in pure TensorFlow

☆18

Alternatives and similar repositories for TRPO-TensorFlow

Users that are interested in TRPO-TensorFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting
View on GitHub
We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting
☆12Mar 9, 2018Updated 8 years ago
gd-zhang / ACKTR
View on GitHub
Actor Critic using Kronecker-Factored Trust Region
☆19Jul 3, 2018Updated 8 years ago
CSKrishna / Recommender-Systems-for-Implicit-Feedback-datasets
View on GitHub
Matrix Factorization augmented with customer item meta data
☆22Nov 2, 2017Updated 8 years ago
LectureTracking / trackhd
View on GitHub
An open-source, automated, lecture recording system that tracks the presenter in 4K video streams
☆12Sep 24, 2018Updated 7 years ago
hamidaucc / Link-state-protocol-in-python
View on GitHub
This project implenments the OSPF using Dijkstra algorithm (Open Shortest Path First) network protocol in python. Link-State Routing pr…
☆12Sep 1, 2017Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ischlag / tensorflow-input-pipelines
View on GitHub
TensorFlow input pipelines for multiple datasets for easy data fetching
☆54Dec 19, 2016Updated 9 years ago
alec-g / rtsp_ffmpeg_ros
View on GitHub
Read and decode RTSP video streams using FFMPEG and publish them as ROS images
☆10Feb 24, 2021Updated 5 years ago
xiaotianliu01 / DQN-with-MCTS-on-Adaptive-Bitrate
View on GitHub
Official codes for "Training Deep Q-Network via Monte Carlo Tree Search for Adaptive Bitrate Control in Video Delivery"
☆10Jul 21, 2023Updated 3 years ago
oowekyala / a-maze-in-python
View on GitHub
Maze generation & solving with Python
☆10Oct 2, 2021Updated 4 years ago
nateraw / encoded-video
View on GitHub
Utilities for working with videos
☆13Jul 5, 2025Updated last year
facebookresearch / WhereDidMyOptimumGo
View on GitHub
An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods - EWRL Workshop 2018
☆16Oct 28, 2018Updated 7 years ago
microsoft / jackknife-variational-inference
View on GitHub
Demonstration of Jackknife Variational Inference for Variational Autoencoders, related to ICLR 2018 paper.
☆22Feb 21, 2018Updated 8 years ago
wangbx66 / differentially-private-q-learning
View on GitHub
☆13May 16, 2019Updated 7 years ago
pkumusic / E-DRL
View on GitHub
Exploration Strategies for Deep Reinforcement Learning
☆39Oct 31, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
pat-coady / trpo
View on GitHub
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆364Jun 2, 2020Updated 6 years ago
msaroufim / vscode-pytorch-extension
View on GitHub
☆12Mar 28, 2023Updated 3 years ago
YingzhenLi / VRbound
View on GitHub
code release for the NIPS 2016 paper
☆28Oct 21, 2016Updated 9 years ago
landskape-ai / imagenet
View on GitHub
ImageNet training code that implements academic defaults
☆12Jul 15, 2021Updated 5 years ago
claudiom4sir / MdVRNet
View on GitHub
[VISAPP 2022] MdVRNet: Deep Video Restoration under Multiple Distortions
☆12Aug 7, 2024Updated last year
blgpb / streaming-udp-video
View on GitHub
a demo to transport video by UDP
☆12Jan 30, 2019Updated 7 years ago
Orange-OpenSource / GNBP
View on GitHub
A fully trainable BP decoder, enabling the discovery of new parity check matrix through automatic learning
☆14Sep 26, 2022Updated 3 years ago
ab-anand / Video-Encryption
View on GitHub
Encrypting videos using Space-Filling Curves 👨🏿‍💻
☆10Aug 30, 2024Updated last year
sajanglingala / data_adaptive_recon_MRI
View on GitHub
Matlab demos for data adaptive dynamic and diffusion MRI
☆15Mar 8, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jiamings / ais
View on GitHub
Annealed Importance Sampling (AIS) for generative models.
☆16Jul 20, 2018Updated 8 years ago
leonardblier / descriptionlengthdeeplearning
View on GitHub
Experiments from "The Description Length of Deep Learning Models"
☆10Aug 1, 2018Updated 7 years ago
MG2033 / A2C
View on GitHub
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
☆181Feb 10, 2019Updated 7 years ago
InsaneMonster / pasqualini2020prngrl
View on GitHub
GitHub for the article Pseudo Random Number Generation through Reinforcement Learning and Recurrent Neural Networks (Luca Pasqualini and …
☆11Feb 18, 2021Updated 5 years ago
alito / mamele
View on GitHub
Machine learning environment over MAME-supported games
☆15Apr 2, 2026Updated 3 months ago
YuhangSong / Arena-Baselines
View on GitHub
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆103Mar 6, 2025Updated last year
ac-93 / soft-actor-critic
View on GitHub
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆98Jun 22, 2020Updated 6 years ago
happypu326 / CoCo-MILP
View on GitHub
This is the code of CoCo-MILP: Inter-Variable Contrastive and Intra-Constraint Competitive MILP Solution Prediction. AAAI 2026 Oral.
☆16May 13, 2026Updated 2 months ago
dhgrs / chainer-WGAN-GP
View on GitHub
A Chainer implementation of WGAN-GP.
☆12Oct 4, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Butanium / monte-carlo-tree-search-TSP
View on GitHub
Monte Carlo tree search for the travelling salesman problem (MCTS for the TSP)
☆12Jun 18, 2022Updated 4 years ago
wojzaremba / trpo
View on GitHub
☆99Aug 15, 2016Updated 9 years ago
jhayes14 / black-box-attacks
View on GitHub
Comparison of gradient estimation techniques for black-box adversarial examples
☆11Oct 31, 2018Updated 7 years ago
psygement / DRS
View on GitHub
Deep recommendation system
☆13Dec 28, 2016Updated 9 years ago
qilong-zhang / Patch-wise-iterative-attack
View on GitHub
Patch-wise iterative attack (accepted by ECCV 2020) to improve the transferability of adversarial examples.
☆94Mar 13, 2022Updated 4 years ago
dmelis / robust_interpret
View on GitHub
Tools for robustness evaluation in interpretability methods
☆10Jun 25, 2021Updated 5 years ago
thomashirtz / noisy-networks
View on GitHub
Minimal implementation of the network layers of the paper "Noisy Networks for Exploration" using Pytorch.
☆13Mar 15, 2025Updated last year