guosyjlu/OEMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guosyjlu/OEMA)

guosyjlu / OEMA

Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.

☆16

Alternatives and similar repositories for OEMA

Users that are interested in OEMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shaokang-Agent / Awesome-Reinforcement-Learning-Papers
View on GitHub
📚 List of Top-tier Conference Papers on Reinforcement Learning (RL)，including: NeurIPS, AAAI, IJCAI, ICML, AAMAS, ICLR, ICRA, etc. | （AI…
☆11Aug 20, 2023Updated 2 years ago
Shaokang-Agent / WToE
View on GitHub
Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"
☆21Aug 17, 2024Updated last year
Shaokang-Agent / D-F
View on GitHub
Implementation of the paper "Egoism, Utilitarianism and Egalitarianism in Multi-Agent Reinforcement Learning"
☆21Aug 17, 2024Updated last year
Shaokang-Agent / S2L
View on GitHub
Implementation of the paper "Multi-Agent Exploration via Self-Learning and Social Learning"
☆20Dec 7, 2024Updated last year
yifan-h / Multilingual_Space
View on GitHub
Source Code for "Adapters for Enhanced Modeling of Multilingual Knowledge and Text"
☆12Oct 28, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yifan-h / Graph_Probe-Birds_Eye
View on GitHub
Bird’s Eye: Probing for Linguistic Graph Structureswith a Simple Information-Theoretic Approach
☆11Aug 1, 2021Updated 4 years ago
Shaokang-Agent / LLM-Agent-Paper-List
View on GitHub
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
☆12May 2, 2024Updated 2 years ago
yifan-h / MechanisticProbe
View on GitHub
Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
☆15Nov 4, 2023Updated 2 years ago
machine-teaching-group / neurips2022_exploration-guided-reward-shaping
View on GitHub
☆17Oct 11, 2022Updated 3 years ago
Haichao-Zhang / PEX
View on GitHub
Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)
☆64Apr 4, 2023Updated 3 years ago
thuml / SPOT
View on GitHub
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Jun 24, 2023Updated 3 years ago
Shaokang-Agent / DCVTD
View on GitHub
Implementation of the paper "Decentralized Counterfactual Value with Threat Detection for Multi-Agent Reinforcement Learning in Mixed Coo…
☆17Dec 7, 2024Updated last year
shlee94 / Off2OnRL
View on GitHub
☆61Feb 3, 2023Updated 3 years ago
AIDefender / MyDiscor
View on GitHub
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14May 24, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rozenk30 / Quantitative-Comparison-of-RL-and-MPC
View on GitHub
Codes for "Quantitative Comparison of Reinforcement Learning and Data-driven Model Predictive Control for Chemical and Biological Process…
☆12Dec 18, 2023Updated 2 years ago
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago
schatty / EMAC
View on GitHub
[IJCAI 2021] Solving Continuous Control with Episodic Memory
☆15Apr 10, 2022Updated 4 years ago
SAIC-MONTREAL / hyperzero
View on GitHub
Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"
☆24Apr 26, 2023Updated 3 years ago
sa-and / MCD
View on GitHub
☆12Mar 21, 2024Updated 2 years ago
haje01 / distper
View on GitHub
Distributed Priortized Experience Replay
☆10Aug 8, 2018Updated 7 years ago
FSLight1996 / SHER
View on GitHub
code of IJCAI submission "Soft Hindsight Experience Replay"
☆13Mar 23, 2020Updated 6 years ago
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
Mohan-Zhang-u / smpl
View on GitHub
☆24Jan 25, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dmitrykazhdan / MARLeME
View on GitHub
General-purpose library for extracting interpretable models from Multi-Agent Reinforcement Learning systems
☆22May 10, 2020Updated 6 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
daniellawson9999 / online-decision-transformer
View on GitHub
An unofficial implementation for online decision transformer
☆41Sep 20, 2022Updated 3 years ago
ReedZyd / GenerativeReturnDecomposition
View on GitHub
Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)
☆10Dec 12, 2023Updated 2 years ago
Benn314 / Typora-Ben-Themes
View on GitHub
一款基于 Typora 的赛博朋克复古风主题（含 Mac 红绿灯）
☆10Mar 9, 2024Updated 2 years ago
MehranTaghian / SAC_GCN
View on GitHub
Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.
☆12Aug 20, 2024Updated last year
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
joeegan17 / DQN-for-Electrical-Microgrid-Control
View on GitHub
Reinforcement learning project using deep Q-learning to control the operations of an electrical microgrid
☆11Jan 3, 2023Updated 3 years ago
yifan-h / GCS_KI
View on GitHub
What Has Been Enhanced in my Knowledge-Enhanced Language Model?
☆13Oct 26, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
brewinn / Roadrunner-CellCounter
View on GitHub
A cell counter using computer vision techniques.
☆10May 13, 2022Updated 4 years ago
Charlie0257 / T2TL
View on GitHub
Exploiting Transformer in Reinforcement Learning for Interpretable Temporal Logic Motion Planning (RAL 2023)
☆12Jul 17, 2023Updated 3 years ago
TanguyLevent / RL4Microgrids
View on GitHub
RL for Energy Management of Microgrids
☆11Mar 28, 2020Updated 6 years ago
lizhuo-1994 / NECSA
View on GitHub
Official implementation of Neural Episodic Control with State Abstraction
☆13Aug 3, 2023Updated 2 years ago
hari-sikchi / offline_rl
View on GitHub
Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.
☆23Aug 27, 2022Updated 3 years ago
ammar-n-abbas / Predictive-Maintenance-BC-IOHMM-DRL
View on GitHub
Hierarchical Framework for Interpretable Deep Reinforcement Learning Based- Predictive Maintenance (Applied to NASA Turbofan engine datas…
☆14Feb 9, 2024Updated 2 years ago
Cloud0723 / Offline-MLIRL
View on GitHub
☆22Dec 18, 2023Updated 2 years ago