PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment
☆15Jul 1, 2018Updated 7 years ago
Alternatives and similar repositories for mujoco-pg
Users that are interested in mujoco-pg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆59Jun 30, 2020Updated 5 years ago
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- ☆10Apr 13, 2023Updated 3 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- Asimov Humanoid Locomotion☆54Dec 18, 2025Updated 5 months ago
- Sumo OSM short usage tutorial☆15Feb 7, 2018Updated 8 years ago
- ☆46Apr 11, 2026Updated last month
- Collect orderbook data from crypto exchanges and publish as GRPC☆13Jun 19, 2022Updated 3 years ago
- 关于混合高斯模型的期望最大算法的实现☆11Aug 24, 2018Updated 7 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- Quadrupedal locomotion for slippery terrains, Mujoco and Real Go1-Go2 Unitree's.☆18Mar 26, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A repository for controlling actuators☆24Sep 15, 2025Updated 8 months ago
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 6 years ago
- ☆18Mar 2, 2025Updated last year
- ☆11Jul 14, 2021Updated 4 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 7 months ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago
- An implementation of Short Horizon Actor Critic writen in Jax. Core algorithm written in the style of Brax, with several bits taken from …☆23Nov 4, 2024Updated last year
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LS증권 OpenApi 샘플☆17May 9, 2025Updated last year
- Implementation of FB8, a generalization of the Kent (1982) and Bingham-Mardia (1978) distributions on a sphere☆19Apr 8, 2026Updated last month
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 5 months ago
- Provide full reinforcement learning benchmark on mujoco environments, including ddpg, sac, td3, pg, a2c, ppo, library☆88Apr 29, 2021Updated 5 years ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- These are the core packages for the Shadow Robot hardware and simulation.☆19Aug 19, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep Learning the Sorting Algorithm☆12Dec 11, 2016Updated 9 years ago
- A news based stock scalper using LLM and quant approach☆15Jan 16, 2025Updated last year
- ☆10Apr 2, 2018Updated 8 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- VQ-TR repository☆12Apr 18, 2024Updated 2 years ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- My Data Provider: A minimal multi-exchange data providing project to feed trading algorithms/bots. Built with Python and FastAPI.☆11May 30, 2024Updated last year