holarissun/PCHID_code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/holarissun/PCHID_code)

holarissun / PCHID_code

Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics

☆15

Alternatives and similar repositories for PCHID_code

Users that are interested in PCHID_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

srsohn / shortest-path-rl
View on GitHub
A public repo for ICML 2021 "Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks"
☆13Jul 19, 2021Updated 5 years ago
YangRui2015 / AWGCSL
View on GitHub
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
☆27Feb 21, 2022Updated 4 years ago
dmksjfl / DARC
View on GitHub
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
☆22Mar 11, 2022Updated 4 years ago
wenzhe-li / romi
View on GitHub
Code for NeurIPS 2021 paper "Offline Reinforcement Learning with Reverse Model-based Imagination"
☆20Dec 22, 2021Updated 4 years ago
FangchenLiu / map_planner
View on GitHub
Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.
☆16Dec 26, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
gjp1203 / nui_in_madrl
View on GitHub
Negative Update Intervals in Multi-Agent Deep Reinforcement Learning
☆35May 14, 2019Updated 7 years ago
yalidu / liir
View on GitHub
Learning Individual Intrinsic Reward in MARL
☆65Dec 8, 2022Updated 3 years ago
metadriverse / TS2C
View on GitHub
[ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"
☆14Apr 30, 2023Updated 3 years ago
metadriverse / AIM
View on GitHub
☆14Jul 23, 2025Updated 11 months ago
Hadisalman / trajectory-optimized-active-search
View on GitHub
Code for the ICRA2018 paper "Trajectory-Optimized Sensing for Active Search of Tissue Abnormalities in Robotic Surgery"
☆12May 22, 2018Updated 8 years ago
mingen-pan / Reinforcement-Learning-Q-learning-Gridworld-Pytorch
View on GitHub
This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld
☆14Jul 13, 2020Updated 6 years ago
jacobleehei / genforce-streamlit
View on GitHub
☆16Oct 20, 2021Updated 4 years ago
luogongning / PlaqueRL
View on GitHub
☆11Jan 17, 2025Updated last year
greatwallet / mountain-car
View on GitHub
A simple baseline for mountain-car @ gym
☆12Jan 15, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
decisionforce / HACO
View on GitHub
[ICLR 2022] Official implementation of paper: Efficient Learning of Safe Driving Policy via Human-AI Copilot Optimization
☆54Dec 23, 2022Updated 3 years ago
Ash1362 / ray-based-quantitative-ultrasound-tomography
View on GitHub
r-Wave: An open-source MATLAB package for quantitative ultrasound tomography via ray-Born inversion with in vitro and in vivo validation
☆15Jul 6, 2026Updated 2 weeks ago
mjamroz / PlantRecognition
View on GitHub
Example of android app written in Qt/Qml which uses MXNet for plant image recognition.
☆10Nov 4, 2017Updated 8 years ago
JasonMa2016 / GoFAR
View on GitHub
Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)
☆39Oct 19, 2023Updated 2 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
rag-h / MATLAB_UR5e_RTDE
View on GitHub
An abstraction layer allowing tcp communication between Matlab (windows) to ursim (linux in vm).
☆17Jun 1, 2026Updated last month
kevinczhou / 3d-ocrt
View on GitHub
Computational 3D microscopy with optical coherence refraction tomography (OCRT)
☆12Jun 2, 2022Updated 4 years ago
Ktakuya332C / deepcube
View on GitHub
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆14Dec 9, 2018Updated 7 years ago
martius-lab / learningwithmuscles
View on GitHub
Repo for the paper: Learning with Muscles: Benefits for Data-Efficiency and Robustness in Anthropomorphic Tasks. https://al.is.mpg.de/pub…
☆16Dec 1, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
dgoodwin208 / Registration
View on GitHub
3D feature-based image registration for neuroscience datasets
☆14Aug 23, 2017Updated 8 years ago
abaheti95 / LoL-RL
View on GitHub
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients
☆26Sep 10, 2024Updated last year
oxwhirl / comix
View on GitHub
☆42Mar 19, 2021Updated 5 years ago
ming93 / Safe_reinforcement_learning
View on GitHub
Convergent Policy Optimization for Safe Reinforcement Learning
☆11Oct 26, 2019Updated 6 years ago
microsoft / EPPO
View on GitHub
An implementation of effective policy ensemble.
☆16Jul 5, 2023Updated 3 years ago
tomsilver / camps
View on GitHub
Code for
☆15Oct 16, 2020Updated 5 years ago
ZhaozhiQIAN / SyncTwin-NeurIPS-2021
View on GitHub
Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)
☆12Nov 30, 2021Updated 4 years ago
tianjunz / NovelD
View on GitHub
☆40Nov 23, 2021Updated 4 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tengxiao1 / SimPER
View on GitHub
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)
☆17Aug 22, 2025Updated 10 months ago
Hasenpfote / dualquat
View on GitHub
Class template for dual quaternions using Eigen.
☆14Aug 27, 2023Updated 2 years ago
Selsion / DSPMods
View on GitHub
☆11Nov 11, 2025Updated 8 months ago
gabb7 / AReS-MaRS
View on GitHub
Python 3.6 and TensorFlow implementation of the AReS and MaRS algorithms
☆11Jun 23, 2019Updated 7 years ago
prajjwal1 / rl_paradigm
View on GitHub
Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"
☆17Jan 31, 2024Updated 2 years ago
PeiZhou26 / MaxMI
View on GitHub
A Maximal Mutual Information Criterion for Manipulation Concept Discovery
☆14Sep 26, 2024Updated last year
lizhuo-1994 / NECSA
View on GitHub
Official implementation of Neural Episodic Control with State Abstraction
☆13Aug 3, 2023Updated 2 years ago