Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
☆15Jan 7, 2020Updated 6 years ago
Alternatives and similar repositories for PCHID_code
Users that are interested in PCHID_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"☆13Jul 19, 2021Updated 4 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆27Feb 21, 2022Updated 4 years ago
- Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.☆22Mar 11, 2022Updated 4 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆29Dec 11, 2020Updated 5 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆28May 22, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Dec 26, 2023Updated 2 years ago
- Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"☆20Dec 22, 2021Updated 4 years ago
- ☆33Nov 21, 2022Updated 3 years ago
- Learning Individual Intrinsic Reward in MARL☆65Dec 8, 2022Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆65Sep 6, 2023Updated 2 years ago
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆14Apr 30, 2023Updated 3 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆14Jul 13, 2020Updated 5 years ago
- Code for the ICRA2018 paper "Trajectory-Optimized Sensing for Active Search of Tissue Abnormalities in Robotic Surgery"☆11May 22, 2018Updated 8 years ago
- ☆15Jul 23, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jan 17, 2025Updated last year
- A simple baseline for mountain-car @ gym☆12Jan 15, 2020Updated 6 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- [ICLR 2022] Official implementation of paper: Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization☆54Dec 23, 2022Updated 3 years ago
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆38Oct 19, 2023Updated 2 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 4 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆84May 16, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Computational 3D microscopy with optical coherence refraction tomography (OCRT)☆12Jun 2, 2022Updated 4 years ago
- Convergent Policy Optimization for Safe Reinforcement Learning☆11Oct 26, 2019Updated 6 years ago
- Balanced K-means in Pytorch with strong GPU acceleration☆12Apr 30, 2020Updated 6 years ago
- An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"☆14Dec 9, 2018Updated 7 years ago
- 3D feature-based image registration for neuroscience datasets☆14Aug 23, 2017Updated 8 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆27Sep 10, 2024Updated last year
- ☆42Mar 19, 2021Updated 5 years ago
- An implementation of effective policy ensemble.☆16Jul 5, 2023Updated 2 years ago
- Class template for dual quaternions using Eigen.☆14Aug 27, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- ☆40Nov 23, 2021Updated 4 years ago
- Code for☆15Oct 16, 2020Updated 5 years ago
- A Maximal Mutual Information Criterion for Manipulation Concept Discovery☆13Sep 26, 2024Updated last year
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 9 months ago
- ☆11Nov 11, 2025Updated 7 months ago