alxthm / rl-cheatsheetLinks
A summary of important concepts and algorithms in RL
☆36Updated 3 years ago
Alternatives and similar repositories for rl-cheatsheet
Users that are interested in rl-cheatsheet are comparing it to the libraries listed below
Sorting:
- Open source code for paper "Denoised MDPs: Learning World Models Better Than the World Itself"☆134Updated last year
- Avalanche fork adding RL support☆77Updated 2 years ago
- NEVIS'22: Benchmarking the next generation of never-ending learners☆102Updated 2 years ago
- PyTorch Package For Quasimetric Learning☆42Updated 8 months ago
- Plug-and-play hydra sweepers for the EA-based multifidelity method DEHB and several population-based training variations, all proven to e…☆81Updated last year
- A2C is a special case of PPO!☆22Updated 3 years ago
- ☆55Updated 2 years ago
- Code for the paper Language as a Cognitive Tool to Imagine Goals in Curiosity Driven Exploration☆32Updated 4 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- Implementation of BC-IRL and other IRL baselines☆28Updated 2 years ago
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆27Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆34Updated 2 years ago
- ☆44Updated 9 months ago
- ☆45Updated last year
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆75Updated 3 years ago
- Building blocks for productive research☆59Updated 5 months ago
- ☆17Updated last year
- ☆36Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago
- ☆36Updated this week
- Crawl & Visualize ICLR 2023 Data from OpenReview☆84Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 10 months ago
- Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"☆27Updated last year
- MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection☆46Updated 2 years ago
- ☆101Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 9 months ago
- ☆32Updated 8 months ago
- Solving the Abstraction & Reasoning Corpus with DreamCoder☆49Updated 9 months ago