pfnet-research/capg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pfnet-research/capg)

pfnet-research / capg

Implementation of clipped action policy gradient (CAPG) with PPO and TRPO

☆31

Alternatives and similar repositories for capg

Users that are interested in capg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
vub-ai-lab / bdpi
View on GitHub
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
☆25Sep 9, 2019Updated 6 years ago
ccthien / MetalWarfareML
View on GitHub
Metal Warfare game for ML-Agents challenge
☆18Feb 24, 2018Updated 8 years ago
floringogianu / snrl
View on GitHub
Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.
☆11Jun 26, 2021Updated 5 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Breakend / ReproducibilityInContinuousPolicyGradientMethods
View on GitHub
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…
☆17Sep 20, 2017Updated 8 years ago
holken / polite
View on GitHub
code for polite
☆12Feb 28, 2024Updated 2 years ago
compsciencelab / ppo_D
View on GitHub
This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…
☆19Oct 5, 2021Updated 4 years ago
cosmoharrigan / rc-nfq
View on GitHub
RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…
☆12Mar 17, 2021Updated 5 years ago
snt-robotics / denmpc
View on GitHub
An event-based on-line adaptable fast nonlinear model predictive control framework
☆25Oct 29, 2018Updated 7 years ago
mitmul / tfchain
View on GitHub
Run a static part of the computational graph written in Chainer with Tensorflow
☆20Jan 10, 2017Updated 9 years ago
felrock / PyRacecarSimulator
View on GitHub
MIT racecar_simulator ported to python and speeded up using GPU ray marching
☆20May 25, 2020Updated 6 years ago
voot-t / guide-actor-critic
View on GitHub
Keras implementation of guide actor-critic for continuous control
☆11Mar 12, 2018Updated 8 years ago
tonysy / CapsuleNet-PyTorch
View on GitHub
Implemention of CapsNet from the paper Dynamic Routing Between Capsules
☆10Nov 7, 2017Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
illidanlab / cdrl
View on GitHub
Collaborative Deep Reinforcement Learning
☆32Jul 29, 2017Updated 8 years ago
musyoku / papers
View on GitHub
実装するリスト
☆10Dec 21, 2017Updated 8 years ago
ChunyuanLI / pSGLD
View on GitHub
AAAI & CVPR 2016: Preconditioned Stochastic Gradient Langevin Dynamics (pSGLD)
☆36Sep 16, 2018Updated 7 years ago
voot-t / vild_code
View on GitHub
Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …
☆20Aug 16, 2021Updated 4 years ago
erfanvaredi / diabetic-retinal-classification
View on GitHub
Diabetic classification based on retinal images
☆11Aug 26, 2019Updated 6 years ago
eringrant / spirl-readings
View on GitHub
A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.
☆13May 5, 2021Updated 5 years ago
Breakend / DeepReinforcementLearningThatMatters
View on GitHub
Accompanying code for "Deep Reinforcement Learning that Matters"
☆154Sep 22, 2017Updated 8 years ago
initial-h / FlappyBird_DQN_with_target_network
View on GitHub
DQN with freezing target network in tensorflow on pygame FlappyBird
☆11Dec 19, 2018Updated 7 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sparisi / td-reg
View on GitHub
TD-Regularized Actor-Critic Methods
☆37Dec 26, 2019Updated 6 years ago
tgangwani / SelfImitationDiverse
View on GitHub
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Nov 26, 2020Updated 5 years ago
mariacer / strong_dfc
View on GitHub
Minimizing Control for Credit Assignment with Strong Feedback
☆14Nov 3, 2024Updated last year
TomZahavy / CB_AE_DQN
View on GitHub
Contextual Bandits Action Elimination DQN
☆21Jun 25, 2018Updated 8 years ago
kazuki-shin / lidar-montecarlo-pathplanning
View on GitHub
CS 598 Final Project: Self Driving using Path Planning with Monte Carlo Tree Search on Lidar Data
☆26Oct 6, 2020Updated 5 years ago
banma12956 / HIPI-RL
View on GitHub
☆10Jun 22, 2020Updated 6 years ago
tyunist / memory_efficient_mish_swish
View on GitHub
A memory efficient implementation of custom SWISH and MISH activation functions in Pytorch
☆12Jun 29, 2020Updated 6 years ago
lionelblonde / sam-tf
View on GitHub
TensorFlow implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
☆10Dec 8, 2022Updated 3 years ago
chscheller / minerl_agent
View on GitHub
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Mar 24, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
karthikncode / Grounded-RL-Transfer
View on GitHub
☆13Dec 6, 2018Updated 7 years ago
KAIST-AILab / gmmil
View on GitHub
Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"
☆11Oct 2, 2018Updated 7 years ago
ml-lab-cuny / menge_ros
View on GitHub
Crowd simulation tool for robot navigation
☆41Feb 20, 2020Updated 6 years ago
jren03 / garage
View on GitHub
⚡️ Shockingly fast imitation learning algorithms via combining online and offline data engines. ⚡️
☆14Sep 1, 2025Updated 10 months ago
pfnet-research / menoh-haskell
View on GitHub
Haskell binding for Menoh DNN inference library
☆12Nov 30, 2018Updated 7 years ago
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
rll-research / finetune-vs-metarl
View on GitHub
☆14May 31, 2022Updated 4 years ago