ikostrikov/pytorch-ddpg-naf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ikostrikov/pytorch-ddpg-naf)

ikostrikov / pytorch-ddpg-naf

Implementation of algorithms for continuous control (DDPG and NAF).

☆313

Alternatives and similar repositories for pytorch-ddpg-naf

Users that are interested in pytorch-ddpg-naf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ghliu / pytorch-ddpg
View on GitHub
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
☆632Aug 13, 2018Updated 7 years ago
ikostrikov / pytorch-trpo
View on GitHub
PyTorch implementation of Trust Region Policy Optimization
☆448Sep 13, 2018Updated 7 years ago
ikostrikov / pytorch-rl
View on GitHub
☆58Aug 28, 2018Updated 7 years ago
floringogianu / categorical-dqn
View on GitHub
A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 8 years ago
ajgupta93 / d4pg-pytorch
View on GitHub
In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.
☆19Jun 15, 2018Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ikostrikov / pytorch-a3c
View on GitHub
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
☆1,328Sep 25, 2019Updated 6 years ago
ikostrikov / pytorch-a2c-ppo-acktr-gail
View on GitHub
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…
☆3,903May 29, 2022Updated 4 years ago
ikostrikov / pytorch-meta-optimizer
View on GitHub
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
☆315Aug 27, 2018Updated 7 years ago
dgriff777 / rl_a3c_pytorch
View on GitHub
A3C LSTM Atari with Pytorch plus A3G design
☆566Apr 18, 2023Updated 3 years ago
jingweiz / pytorch-dnc
View on GitHub
Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom
☆279Feb 20, 2018Updated 8 years ago
wojzaremba / trpo
View on GitHub
☆99Aug 15, 2016Updated 9 years ago
vy007vikas / PyTorch-ActorCriticRL
View on GitHub
PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.
☆422Mar 17, 2021Updated 5 years ago
jingweiz / pytorch-distributed
View on GitHub
Ape-X DQN & DDPG with pytorch & tensorboard
☆102Jun 18, 2019Updated 7 years ago
Kaixhin / ACER
View on GitHub
Actor-critic with experience replay
☆257Oct 9, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jingweiz / pytorch-rl
View on GitHub
Deep Reinforcement Learning with pytorch & visdom
☆802Jul 16, 2020Updated 5 years ago
zoeyuchao / maddpg-pytorch
View on GitHub
This is pytorch version of maddpg.
☆10Jun 23, 2020Updated 6 years ago
rail-berkeley / rlkit
View on GitHub
Collection of reinforcement learning algorithms
☆2,915Jun 17, 2024Updated 2 years ago
rll / rllab
View on GitHub
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
☆3,069Jun 10, 2023Updated 3 years ago
miyosuda / episodic_control
View on GitHub
Model-Free Episodic Control
☆14Jan 12, 2017Updated 9 years ago
andrewliao11 / pytorch-a3c-mujoco
View on GitHub
Implement A3C for Mujoco gym envs
☆73Nov 2, 2017Updated 8 years ago
floodsung / DDPG
View on GitHub
Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow
☆575Sep 28, 2021Updated 4 years ago
carpedm20 / NAF-tensorflow
View on GitHub
"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
☆192Jul 20, 2018Updated 7 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ypxie / pytorch-NeuCom
View on GitHub
Pytorch implementation of DeepMind's differentiable neural computer paper.
☆91Dec 4, 2017Updated 8 years ago
awjuliani / Meta-RL
View on GitHub
Implementation of Meta-RL A3C algorithm
☆407Feb 22, 2017Updated 9 years ago
haarnoja / softqlearning
View on GitHub
Reinforcement Learning with Deep Energy-Based Policies
☆438Nov 28, 2023Updated 2 years ago
denisyarats / pytorch_sac_ae
View on GitHub
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
☆257May 3, 2020Updated 6 years ago
Khrylx / PyTorch-RL
View on GitHub
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…
☆1,286Feb 9, 2021Updated 5 years ago
joschu / modular_rl
View on GitHub
Implementation of TRPO and related algorithms
☆653May 20, 2018Updated 8 years ago
Kaixhin / PlaNet
View on GitHub
Deep Planning Network: Control from pixels by latent planning with learned dynamics
☆377Oct 15, 2021Updated 4 years ago
moskomule / pytorch.rl.learning
View on GitHub
for learning reinforcement learning using PyTorch.
☆64Oct 2, 2019Updated 6 years ago
junhyukoh / value-prediction-network
View on GitHub
NIPS 2017 Value Prediction Network
☆166Jan 12, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
tilarids / reinforcement_learning_playground
View on GitHub
Playground for reinforcement learning algorithms implemented in TensorFlow
☆16Oct 18, 2016Updated 9 years ago
mcgillmrl / robot_learning
View on GitHub
ROS package for robot learning
☆17Oct 16, 2019Updated 6 years ago
dgriff777 / a3c_continuous
View on GitHub
A continuous action space version of A3C LSTM in pytorch plus A3G design
☆259Oct 11, 2024Updated last year
iassael / learning-to-communicate
View on GitHub
Learning to Communicate with Deep Multi-Agent Reinforcement Learning
☆449Feb 21, 2019Updated 7 years ago
alexis-jacq / Pytorch-DPPO
View on GitHub
Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286
☆184Mar 25, 2018Updated 8 years ago
zuoxingdong / VIN_PyTorch_Visdom
View on GitHub
PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.
☆223Mar 29, 2017Updated 9 years ago
ShangtongZhang / DeepRL
View on GitHub
Modularized Implementation of Deep RL Algorithms in PyTorch
☆3,429Apr 16, 2024Updated 2 years ago