xihuai18/awesome-RL-generalization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xihuai18/awesome-RL-generalization)

xihuai18 / awesome-RL-generalization

A list of papers regarding generalization in (deep) reinforcement learning

☆11

Alternatives and similar repositories for awesome-RL-generalization

Users that are interested in awesome-RL-generalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shichenhui / SpectraClustering
View on GitHub
Clustering algorithms processing methods on astronomical spectra.
☆10Oct 24, 2023Updated 2 years ago
thomyphan / scalable-marl
View on GitHub
Scalable Multi-Agent Reinforcement Learning
☆15Dec 25, 2021Updated 4 years ago
EMI-Group / BLVG
View on GitHub
Matlab code for the IEEE TCYB paper "Evolutionary Large-Scale Dynamic Optimization Using Bilevel Variable Grouping".
☆11May 16, 2022Updated 4 years ago
PatrickGuo / Mistify
View on GitHub
☆10May 16, 2021Updated 5 years ago
ir-uam / kNNBandit
View on GitHub
Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendati…
☆21Apr 4, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PKU-RL / CORRO
View on GitHub
[ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning
☆40Aug 17, 2022Updated 3 years ago
metekemertas / RobustBisimulation
View on GitHub
Learning bisimulation metrics for control, particularly suited to sparse reward settings
☆11Feb 28, 2023Updated 3 years ago
TianhongDai / metaworld-sac
View on GitHub
☆12Aug 28, 2020Updated 5 years ago
LikeGiver / VideoRAG
View on GitHub
a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs
☆15Jun 2, 2024Updated 2 years ago
yunglau / QGFN
View on GitHub
QGFN: Controllable Greediness with Action Values - Code
☆11May 17, 2024Updated 2 years ago
sjtu-marl / ZSC-Eval
View on GitHub
This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…
☆56Nov 22, 2025Updated 8 months ago
WEIRDLabUW / dispo
View on GitHub
Distributional Successor Features Enable Zero-Shot Policy Optimization
☆15Apr 11, 2025Updated last year
AffordableGenerativeAgents / Affordable-Generative-Agents
View on GitHub
☆57Aug 28, 2024Updated last year
RuixiaoZhang / Pensieve-Pytorch
View on GitHub
A Pytorch implementation of Pensieve (SIGCOMM'18)
☆12Jun 17, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AlloyTools / alloytools.github.io
View on GitHub
Website for Alloytools
☆13Nov 3, 2025Updated 8 months ago
ec2604 / ContraBAR
View on GitHub
☆13May 21, 2023Updated 3 years ago
linjh1118 / MLLM-from-scratch
View on GitHub
《多模态大模型部署微调指南》快速部署/微调多模态大模型
☆14Dec 4, 2024Updated last year
ido90 / RobustMetaRL
View on GitHub
A variant of Varibad that is robust to difficult tasks
☆11Aug 30, 2023Updated 2 years ago
maohangyu / PDiT
View on GitHub
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presenta…
☆10Dec 27, 2023Updated 2 years ago
bashendixie / ml_toolset
View on GitHub
☆15Jan 6, 2024Updated 2 years ago
jscriptcoder / Upside-Down-Reinforcement-Learning
View on GitHub
Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)
☆13Oct 25, 2023Updated 2 years ago
hhdo / DaeMon
View on GitHub
This is the official code release of the following paper: Hao Dong et al., Adaptive Path-Memory Network for Temporal Knowledge Graph Reas…
☆19Jan 31, 2024Updated 2 years ago
junaiddk / transmix
View on GitHub
TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning
☆11Oct 18, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
irec-org / irec
View on GitHub
Interactive Recommender Systems Framework
☆26Apr 5, 2024Updated 2 years ago
SafeRL-Lab / Robust-RL-Baselines
View on GitHub
Robust Reinforcement Learning Benchmark
☆13Sep 22, 2024Updated last year
EgOrlukha / MuJoCo-PyTorch
View on GitHub
PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment
☆12Feb 22, 2019Updated 7 years ago
Ricky-Zhu / IRDEC
View on GitHub
[IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours
☆13Mar 3, 2024Updated 2 years ago
APOIR2018 / APOIR
View on GitHub
Adversarial Point-of-Interest Recommendation
☆26Feb 14, 2018Updated 8 years ago
awwang10 / sphinx
View on GitHub
☆14Oct 23, 2025Updated 9 months ago
lmzintgraf / hyperx
View on GitHub
☆16Aug 2, 2022Updated 3 years ago
sbelharbi / learning-class-invariant-features
View on GitHub
Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".
☆12Oct 1, 2017Updated 8 years ago
hanizaidi110 / Opponent-Modeling-and-Predicting-Opponent-moves-in-Poker
View on GitHub
Advanced_Data_Integration_Project
☆11Jul 31, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
akazemipour / Distributional-RL
View on GitHub
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
☆26Jun 17, 2025Updated last year
mwufi / meta-rl-bandits
View on GitHub
A simple RNN meta-learner
☆10Dec 17, 2018Updated 7 years ago
MuyaoYuan / InfoSAM
View on GitHub
☆17Jun 3, 2025Updated last year
subingangadharan / cmu15418
View on GitHub
My solution code to parallel architecture and programming Spring 2016
☆12Aug 15, 2016Updated 9 years ago
LDlabs / seqMultiTaskRNN
View on GitHub
sequential learning in orthogonal subspaces
☆14Nov 20, 2020Updated 5 years ago
YuxiaWu / PLSPL
View on GitHub
[TKDE 2020] Code and data for "Personalized long-and short-term preference learning for next POI recommendation."
☆23Dec 18, 2023Updated 2 years ago
WJ2003B / mqe-release
View on GitHub
Official Release of Multistep Quasimetric Estimation (MQE)
☆18Mar 13, 2026Updated 4 months ago