PyTorch implementation of (Deep) Reinforcement Learning (RL) algorithms
☆25Jun 26, 2022Updated 3 years ago
Alternatives and similar repositories for rl_sandbox_public
Users that are interested in rl_sandbox_public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Manipulation OpenAI Gym environments to simulate robots at the STARS lab, as well as compatible imitation learning tools☆17Jun 21, 2024Updated last year
- Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code☆17Aug 23, 2024Updated last year
- Seeing All the Angles: Learning Multiview Manipulation Policies for Contact-Rich Tasks from Demonstrations☆11Jun 22, 2023Updated 2 years ago
- Accompanying repository for Unsupervised Active Domain Randomization in Goal-Directed RL☆12Aug 4, 2020Updated 5 years ago
- ☆23Aug 19, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.☆11Aug 21, 2023Updated 2 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Mar 14, 2022Updated 4 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 4 years ago
- reinforcement learning from randomized simulations☆68Mar 31, 2025Updated 11 months ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)☆16Mar 3, 2023Updated 3 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆37Jan 24, 2026Updated 2 months ago
- ☆89Sep 28, 2021Updated 4 years ago
- Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“☆10Nov 2, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆22Apr 26, 2023Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]☆49Feb 24, 2026Updated last month
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- ☆134May 8, 2020Updated 5 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- [ICRA'25] H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps☆12Apr 10, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Causal Analysis of Agent Behavior for AI Safety☆20Jun 27, 2023Updated 2 years ago
- ☆12Oct 19, 2023Updated 2 years ago
- ☆63Jan 30, 2026Updated last month
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 3 years ago
- A set of environments utilizing pybullet for simulation of robotic manipulation tasks.☆29Mar 8, 2021Updated 5 years ago
- ☆15Sep 16, 2025Updated 6 months ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 4 years ago
- Cluttered-Scene 6D Grasping with Latent Plans☆19Mar 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Lightweight control environment for Franka robot☆12Mar 16, 2022Updated 4 years ago
- Code for "Convergence of Learning Dynamics in Stackelberg Games"☆13Nov 6, 2019Updated 6 years ago
- An OpenAI Gym style reinforcement learning interface for Agility Robotics' biped robot Cassie☆41Apr 23, 2019Updated 6 years ago
- Advantage weighted Actor Critic for Offline RL☆53Aug 27, 2022Updated 3 years ago
- SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards☆35Jan 28, 2026Updated 2 months ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- Simple maze environments using mujoco-py☆59Dec 27, 2023Updated 2 years ago