PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment
☆15Jul 1, 2018Updated 7 years ago
Alternatives and similar repositories for mujoco-pg
Users that are interested in mujoco-pg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed Priortized Experience Replay☆10Aug 8, 2018Updated 7 years ago
- This is MPE-pytorch, fix some bugs.☆11Apr 26, 2020Updated 6 years ago
- ☆10Apr 13, 2023Updated 3 years ago
- Robust policy search algorithms which train on model ensembles☆31Oct 26, 2016Updated 9 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Sumo OSM short usage tutorial☆15Feb 7, 2018Updated 8 years ago
- Semantic-Aware Fine-Grained Correspondence, at ECCV 2022 (Oral)☆14Oct 29, 2022Updated 3 years ago
- 关于混合高斯模型的期望最大算法的实现☆11Aug 24, 2018Updated 7 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Implement IMPALA architecture from Distributed Deep-RL Paper.☆15Oct 18, 2018Updated 7 years ago
- Implementation of Stochastic Depth Networks in Keras☆13Sep 10, 2016Updated 9 years ago
- Code for 'Inference Suboptimality in Variational Autoencoders'☆10May 22, 2020Updated 6 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆371Aug 1, 2019Updated 6 years ago
- ☆11Jul 14, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18Mar 2, 2025Updated last year
- ☆11Oct 6, 2020Updated 5 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 8 months ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 4 years ago
- ☆18Jun 15, 2023Updated 2 years ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- Implementation of the paper <Model-based Reinforcement Learning for Predictions and Control for Limit Order Books (Wei et al., J.P. Morga…☆11Aug 22, 2023Updated 2 years ago
- Implementation of FB8, a generalization of the Kent (1982) and Bingham-Mardia (1978) distributions on a sphere☆19Apr 8, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 5 months ago
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 9 years ago
- These are the core packages for the Shadow Robot hardware and simulation.☆19Aug 19, 2024Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- A news based stock scalper using LLM and quant approach☆15Jan 16, 2025Updated last year
- ☆10Apr 2, 2018Updated 8 years ago
- GPU-Accelerated Trajectory Optimization☆38May 30, 2026Updated 2 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- VQ-TR repository☆12Apr 18, 2024Updated 2 years ago
- Neo4j 大规模 三元组 CVS 导入进数据库☆11Jul 31, 2020Updated 5 years ago
- Mujoco environment to test GUFIC formulation☆34Updated this week
- A trading system in python with GUI extension in PYQT. Proposed accepted API : many including those in README.☆11Jun 10, 2020Updated 6 years ago
- ☆17Dec 29, 2021Updated 4 years ago
- ☆10Jun 14, 2024Updated 2 years ago
- We developed a task-driven hybrid model reduction method for solving dexterous manipulation with 5 minutes of online learning.☆18Mar 27, 2024Updated 2 years ago