jiangsy/mbpo_pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jiangsy/mbpo_pytorch)

jiangsy / mbpo_pytorch

☆30

Alternatives and similar repositories for mbpo_pytorch

Users that are interested in mbpo_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jiangsy / LAMDA-Beamer-Template
View on GitHub
A beamer template for LAMDA lab at NJU
☆16Oct 17, 2020Updated 5 years ago
x35f / unstable_baselines
View on GitHub
Re-implementations of SOTA RL algorithms.
☆137Sep 7, 2023Updated 2 years ago
typoverflow / UtilsRL
View on GitHub
A python module designed for agile RL algorithm developing.
☆26Jul 11, 2024Updated 2 years ago
Xingyu-Lin / mbpo_pytorch
View on GitHub
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆189Apr 12, 2022Updated 4 years ago
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xionghuichen / MAPLE
View on GitHub
The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)
☆25Jan 16, 2024Updated 2 years ago
AIDefender / MyDiscor
View on GitHub
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14May 24, 2021Updated 5 years ago
xionghuichen / RLAssistant
View on GitHub
RLA is a tool for managing your RL experiments automatically
☆71Feb 7, 2023Updated 3 years ago
polixir / causal-mbrl
View on GitHub
Toolkit of Causal Model-based Reinforcement Learning.
☆33Jun 5, 2023Updated 3 years ago
roosephu / boots
View on GitHub
☆11Oct 14, 2019Updated 6 years ago
yilundu / imagination_augmented_agents
View on GitHub
Replicating Imagination-Augmented Agents for Deep Reinforcement Learning
☆20Dec 17, 2017Updated 8 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
polixir / RLAssistant
View on GitHub
RLA is a tool for managing your RL experiments automatically
☆31Jan 11, 2025Updated last year
apexrl / autombpo
View on GitHub
Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>
☆13Nov 16, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
KyunghyunLee / aes-rl
View on GitHub
☆17Dec 12, 2020Updated 5 years ago
WilsonWangTHU / mbbl-metrpo
View on GitHub
☆16Jun 30, 2019Updated 7 years ago
tianheyu927 / mopo
View on GitHub
Code for MOPO: Model-based Offline Policy Optimization
☆191May 17, 2022Updated 4 years ago
okarthikb / DPO
View on GitHub
Implementation of Direct Preference Optimization
☆17Jul 17, 2023Updated 3 years ago
icaros-usc / dqd-rl
View on GitHub
Official implementation of "Approximating Gradients for Differentiable Quality Diversity in Reinforcement Learning"
☆22Oct 3, 2022Updated 3 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
pairlab / vagram
View on GitHub
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆25Apr 15, 2023Updated 3 years ago
junaiddk / transmix
View on GitHub
TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning
☆11Oct 18, 2022Updated 3 years ago
johnHostetter / AAMAS-2023-FCQL
View on GitHub
A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.
☆11May 29, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
liuxhym / EDIS
View on GitHub
EDIS: Energy-guided DIffusion Sampling
☆19Aug 10, 2024Updated last year
llan-ml / MetaTNE
View on GitHub
Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"
☆10Nov 17, 2020Updated 5 years ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
watchernyu / REDQ
View on GitHub
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆185Nov 14, 2024Updated last year
anishmadan23 / MAML_Pytorch_RL
View on GitHub
☆10Aug 8, 2021Updated 4 years ago
Tusharsd123 / Load-Frequency-Control-
View on GitHub
☆12Nov 11, 2021Updated 4 years ago
sparkmxy / my-offlinerl
View on GitHub
☆26Jun 14, 2022Updated 4 years ago
jidiai / Competition_RL4Stock
View on GitHub
☆17Jan 24, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
prisma-lab / nonprehensile-object-transp
View on GitHub
☆15Aug 8, 2023Updated 2 years ago
LanqingLi1993 / FOCAL-ICLR
View on GitHub
Code for FOCAL Paper Published at ICLR 2021
☆55Dec 4, 2023Updated 2 years ago
yihaosun1124 / mobile
View on GitHub
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆22Apr 17, 2024Updated 2 years ago
skreynolds / ENG720_matlab_models
View on GitHub
A repository of load frequency control models implemeted in matlab
☆12Feb 22, 2025Updated last year
liziniu / policy_optimization
View on GitHub
Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)
☆29Dec 19, 2023Updated 2 years ago
pimdh / causal-confusion
View on GitHub
Code for paper Causal Confusion in Imitation Learning
☆47Dec 17, 2019Updated 6 years ago
wangzizhao / CausalDynamicsLearning
View on GitHub
☆35Oct 23, 2022Updated 3 years ago