dnddnjs/mujoco-pg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dnddnjs/mujoco-pg)

dnddnjs / mujoco-pg

PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment

☆15

Alternatives and similar repositories for mujoco-pg

Users that are interested in mujoco-pg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qingshi9974 / PPO-pytorch-Mujoco
View on GitHub
Implement PPO algorithm on mujoco environment，such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.
☆59Jun 30, 2020Updated 6 years ago
haje01 / distper
View on GitHub
Distributed Priortized Experience Replay
☆10Aug 8, 2018Updated 7 years ago
liloganle / GMM-EM
View on GitHub
关于混合高斯模型的期望最大算法的实现
☆11Aug 24, 2018Updated 7 years ago
zoeyuchao / MPE-pytorch
View on GitHub
This is MPE-pytorch, fix some bugs.
☆11Apr 26, 2020Updated 6 years ago
abaisero / gym-pomdps
View on GitHub
☆10Apr 13, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zoeyuchao / maddpg-pytorch
View on GitHub
This is pytorch version of maddpg.
☆10Jun 23, 2020Updated 6 years ago
ReedZyd / GenerativeReturnDecomposition
View on GitHub
Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)
☆10Dec 12, 2023Updated 2 years ago
xobx-cherif / Sumo-OpenStreetMap
View on GitHub
Sumo OSM short usage tutorial
☆15Feb 7, 2018Updated 8 years ago
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
haje01 / impala
View on GitHub
Implement IMPALA architecture from Distributed Deep-RL Paper.
☆15Oct 18, 2018Updated 7 years ago
ASzot / imagination-augmented-agents-tf
View on GitHub
Imagination Augmented Agents TensorFlow
☆26Mar 30, 2020Updated 6 years ago
transcranial / stochastic-depth
View on GitHub
Implementation of Stochastic Depth Networks in Keras
☆13Sep 10, 2016Updated 9 years ago
chriscremer / Inference-Suboptimality
View on GitHub
Code for 'Inference Suboptimality in Variational Autoencoders'
☆11May 22, 2020Updated 6 years ago
RaghuHemadri / Reinforcement-Learning-Reading-List
View on GitHub
☆11Jul 14, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
reinforcement-learning-kr / pg_travel
View on GitHub
Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)
☆371Aug 1, 2019Updated 6 years ago
keon / cpp-pytorch
View on GitHub
C++ PyTorch Examples
☆10Aug 18, 2019Updated 6 years ago
wiseodd / compound-density-networks
View on GitHub
Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…
☆16May 26, 2022Updated 4 years ago
sculd / algorithmic_intraday_trading
View on GitHub
☆11Oct 6, 2020Updated 5 years ago
holarissun / PCHID_code
View on GitHub
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
☆15Jan 7, 2020Updated 6 years ago
tianluyuan / sphere
View on GitHub
Implementation of FB8, a generalization of the Kent (1982) and Bingham-Mardia (1978) distributions on a sphere
☆19Updated this week
ZhaozhiQIAN / SyncTwin-NeurIPS-2021
View on GitHub
Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)
☆12Nov 30, 2021Updated 4 years ago
wut0n9 / Wechat_Stat
View on GitHub
统计微信朋友圈送出的赞票与得到的赞票人员比例
☆11May 3, 2016Updated 10 years ago
KennethanCeyer / fastcampus-mlops
View on GitHub
☆11Jun 14, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tengxiao1 / SimPER
View on GitHub
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)
☆17Aug 22, 2025Updated 11 months ago
shadow-robot / sr_core
View on GitHub
These are the core packages for the Shadow Robot hardware and simulation.
☆19Aug 19, 2024Updated last year
Jeonghwan-Cheon / lob-world-models
View on GitHub
Implementation of the paper <Model-based Reinforcement Learning for Predictions and Control for Limit Order Books (Wei et al., J.P. Morga…
☆12Aug 22, 2023Updated 2 years ago
srsohn / shortest-path-rl
View on GitHub
A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"
☆13Jul 19, 2021Updated 5 years ago
divyahansg / RecurrentDPG
View on GitHub
CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)
☆10Jun 10, 2017Updated 9 years ago
YanranDing / OAMPC
View on GitHub
☆18Mar 2, 2025Updated last year
lizhuo-1994 / NECSA
View on GitHub
Official implementation of Neural Episodic Control with State Abstraction
☆13Aug 3, 2023Updated 2 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
NingMiao / InteL-VAEs
View on GitHub
Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.
☆18Jun 25, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kashif / vq-tr
View on GitHub
VQ-TR repository
☆12Apr 18, 2024Updated 2 years ago
ethanliuzhuo / Neo4j_Knowledge_Graph_csv_import
View on GitHub
Neo4j 大规模三元组 CVS 导入进数据库
☆11Jul 31, 2020Updated 5 years ago
cloneofsimo / planning-with-diffusion-tutorial
View on GitHub
☆18Jun 15, 2023Updated 3 years ago
joelouismarino / variational_rl
View on GitHub
Variational Reinforcement Learning
☆18Jul 25, 2024Updated last year
holarissun / embedding-based-llm-alignment
View on GitHub
Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs
☆22Apr 24, 2025Updated last year
AminSaqi / mydp
View on GitHub
My Data Provider: A minimal multi-exchange data providing project to feed trading algorithms/bots. Built with Python and FastAPI.
☆11May 30, 2024Updated 2 years ago
Hoshi-No-Ai / CMoE
View on GitHub
CMoE: Contrastive Mixture of Experts for Motion Control and Terrain Adaptation of Humanoid Robots
☆19Jun 24, 2026Updated last month