RajGhugare19/stitching-is-combinatorial-generalisation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RajGhugare19/stitching-is-combinatorial-generalisation)

RajGhugare19 / stitching-is-combinatorial-generalisation

[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.

☆25

Alternatives and similar repositories for stitching-is-combinatorial-generalisation

Users that are interested in stitching-is-combinatorial-generalisation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
robbycostales / HAL
View on GitHub
Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)
☆14Mar 14, 2022Updated 4 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
notmahi / disk
View on GitHub
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆21Mar 22, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
google-deepmind / dmc_vision_benchmark
View on GitHub
☆34Jun 21, 2024Updated 2 years ago
chongyi-zheng / td_infonce
View on GitHub
Implementations of Temporal Difference InfoNCE (TD InfoNCE)
☆35Nov 13, 2023Updated 2 years ago
ldcq / ldcq
View on GitHub
☆35May 24, 2023Updated 3 years ago
HyeonwooNoh / my_ubuntu_settings
View on GitHub
☆10May 13, 2025Updated last year
clvrai / new-actions-rl
View on GitHub
☆24Aug 9, 2024Updated last year
Dragon-Zhuang / Reinformer
View on GitHub
Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL
☆49Oct 16, 2024Updated last year
gauthamvasan / avg
View on GitHub
Action Value Gradient Algorithm
☆28May 18, 2025Updated last year
FLAIROx / jafar
View on GitHub
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
☆107Jan 23, 2025Updated last year
quasimetric-learning / quasimetric-rl
View on GitHub
Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023
☆61May 19, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JesseFarebro / distributional-sr
View on GitHub
Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".
☆23Nov 8, 2024Updated last year
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
ahjwang / messenger-emma
View on GitHub
Implements the Messenger environment and EMMA model.
☆25Jun 14, 2023Updated 3 years ago
srzer / LaMo-2023
View on GitHub
Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".
☆53Apr 11, 2024Updated 2 years ago
twni2016 / Memory-RL
View on GitHub
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆73Apr 26, 2026Updated 3 months ago
bryanoliveira / sliding-puzzles-gym
View on GitHub
A scalable benchmark for state representation learning in visual reinforcement learning.
☆17Jun 23, 2025Updated last year
kwanyoungpark / MAC
View on GitHub
Code for Scalable Offline Model-Based RL with Action chunking
☆30Feb 20, 2026Updated 5 months ago
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mazpie / redundancy-action-spaces
View on GitHub
[RA-L 2024] Novel action spaces leveraging redundancy in 7 DoF arms enable efficient & precise learning in robotic manipulation
☆23Jun 6, 2024Updated 2 years ago
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆29Jan 14, 2025Updated last year
changchencc / Simple-Hierarchical-Planning-with-Diffusion
View on GitHub
☆36Jun 7, 2024Updated 2 years ago
dunnolab / xland-minigrid-datasets
View on GitHub
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025
☆84Feb 13, 2025Updated last year
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
e2crawfo / silot
View on GitHub
Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).
☆13Mar 24, 2023Updated 3 years ago
seohongpark / METRA
View on GitHub
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆92Oct 15, 2023Updated 2 years ago
p-doom / jasmine
View on GitHub
A simple, performant and scalable JAX-based world modeling codebase.
☆155Jan 15, 2026Updated 6 months ago
heatz123 / tldr
View on GitHub
Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations
☆36Jan 24, 2026Updated 6 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆83May 13, 2024Updated 2 years ago
Artur-Galstyan / jaxonloader
View on GitHub
A dataloader, but for JAX
☆20May 17, 2024Updated 2 years ago
VArdulov / ToMNet
View on GitHub
Reimplementation of ToMNet with some extensions for RL as well
☆14Apr 28, 2018Updated 8 years ago
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆442Jan 14, 2026Updated 6 months ago
automl / CARL
View on GitHub
Benchmarking RL generalization in an interpretable way.
☆183Nov 20, 2025Updated 8 months ago
MichalBortkiewicz / JaxGCRL
View on GitHub
Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.
☆273Jun 6, 2026Updated last month
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago