PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment
☆15Jul 1, 2018Updated 7 years ago
Alternatives and similar repositories for mujoco-pg
Users that are interested in mujoco-pg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- This is MPE-pytorch, fix some bugs.☆11Apr 26, 2020Updated 6 years ago
- Mujoco xml model for the Fetch Robotics Freight mobile base + Panda arm☆11Nov 29, 2023Updated 2 years ago
- ☆10Apr 13, 2023Updated 3 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Asimov Humanoid Locomotion☆53Dec 18, 2025Updated 4 months ago
- ☆47Apr 11, 2026Updated 3 weeks ago
- Semantic-Aware Fine-Grained Correspondence, at ECCV 2022 (Oral)☆14Oct 29, 2022Updated 3 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- Quadrupedal locomotion for slippery terrains, Mujoco and Real Go1-Go2 Unitree's.☆18Mar 26, 2025Updated last year
- Implementation of Stochastic Depth Networks in Keras☆13Sep 10, 2016Updated 9 years ago
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 5 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆372Aug 1, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Mar 2, 2025Updated last year
- ☆11Jul 14, 2021Updated 4 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- ☆11Oct 6, 2020Updated 5 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- ☆18Jun 15, 2023Updated 2 years ago
- An implementation of Short Horizon Actor Critic writen in Jax. Core algorithm written in the style of Brax, with several bits taken from …☆22Nov 4, 2024Updated last year
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 10 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- LS증권 OpenApi 샘플☆16May 9, 2025Updated 11 months ago
- Implementation of the paper <Model-based Reinforcement Learning for Predictions and Control for Limit Order Books (Wei et al., J.P. Morga…☆11Aug 22, 2023Updated 2 years ago
- Implementation of FB8, a generalization of the Kent (1982) and Bingham-Mardia (1978) distributions on a sphere☆19Apr 8, 2026Updated 3 weeks ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 4 months ago
- Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library☆88Apr 29, 2021Updated 5 years ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- These are the core packages for the Shadow Robot hardware and simulation.☆19Aug 19, 2024Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 8 months ago
- Deep Learning the Sorting Algorithm☆12Dec 11, 2016Updated 9 years ago
- A news based stock scalper using LLM and quant approach☆15Jan 16, 2025Updated last year
- ☆10Apr 2, 2018Updated 8 years ago
- GPU-Accelerated Trajectory Optimization☆36Feb 26, 2026Updated 2 months ago