A list of papers regarding generalization in (deep) reinforcement learning
☆11Aug 13, 2023Updated 2 years ago
Alternatives and similar repositories for awesome-RL-generalization
Users that are interested in awesome-RL-generalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Clustering algorithms processing methods on astronomical spectra.☆10Oct 24, 2023Updated 2 years ago
- Scalable Multi-Agent Reinforcement Learning☆15Dec 25, 2021Updated 4 years ago
- Matlab code for the IEEE TCYB paper "Evolutionary Large-Scale Dynamic Optimization Using Bilevel Variable Grouping".☆11May 16, 2022Updated 3 years ago
- This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. P…☆55Nov 22, 2025Updated 5 months ago
- ☆10May 16, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendati…☆21Apr 4, 2024Updated 2 years ago
- [ICML 2022] Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning☆40Aug 17, 2022Updated 3 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 10 months ago
- ☆12Aug 28, 2020Updated 5 years ago
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Jun 2, 2024Updated last year
- Repository for (for now) filing bug reports about PLAI.☆15Jul 5, 2025Updated 10 months ago
- QGFN: Controllable Greediness with Action Values - Code☆11May 17, 2024Updated last year
- ☆56Aug 28, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Distributional Successor Features Enable Zero-Shot Policy Optimization☆14Apr 11, 2025Updated last year
- A Pytorch implementation of Pensieve (SIGCOMM'18)☆12Jun 17, 2020Updated 5 years ago
- RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning☆18May 24, 2023Updated 2 years ago
- SineKAN: Kolmogorov-Arnold Networks Using Sinusoidal Activation Functions☆15Dec 19, 2024Updated last year
- ☆13May 21, 2023Updated 2 years ago
- A variant of Varibad that is robust to difficult tasks☆11Aug 30, 2023Updated 2 years ago
- 《多模态大模型部署微调指南》快速部署/微调多模态大模型☆12Dec 4, 2024Updated last year
- ☆15Jan 6, 2024Updated 2 years ago
- ☆26Oct 25, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Website for Alloytools☆13Nov 3, 2025Updated 6 months ago
- Official implementation of Attention (as discrete-time Markov) Chains☆24Nov 4, 2025Updated 6 months ago
- Landing a Spaceship using Upside-Down Reinforcement Learning (a.k.a ⅂ꓤ)☆13Oct 25, 2023Updated 2 years ago
- This is the official code release of the following paper: Hao Dong et al., Adaptive Path-Memory Network for Temporal Knowledge Graph Reas…☆19Jan 31, 2024Updated 2 years ago
- PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presenta…☆10Dec 27, 2023Updated 2 years ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆12Feb 22, 2019Updated 7 years ago
- [IROS2023]Learning to Solve Tasks with Exploring Prior Behaviours☆12Mar 3, 2024Updated 2 years ago
- ☆13Oct 23, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Aug 2, 2022Updated 3 years ago
- Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".