huiwon-jang / RSPLinks

Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)

☆23

Alternatives and similar repositories for RSP

Users that are interested in RSP are comparing it to the libraries listed below

Sorting:

ykarmesh / stable-control-representations
Code for Stable Control Representations
☆26Updated 6 months ago
pairlab / SlotFormer
Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models
☆114Updated 2 years ago
CVMI-Lab / SlotMIM
(CVPR 2025) A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
☆19Updated 7 months ago
younggyoseo / MV-MWM
☆60Updated 2 years ago
kahnchana / LangToMo
[WIP] Code for LangToMo
☆20Updated 4 months ago
amazon-science / object-centric-learning-framework
☆85Updated 2 months ago
singhgautam / sysbinder
Official Code for Neural Systematic Binder
☆33Updated 2 years ago
joyhsu0504 / LEFT
☆46Updated last year
s-tian / vp2
VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)
☆29Updated 7 months ago
Max-Fu / otter
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆107Updated 6 months ago
martius-lab / videosaur
Repository for our paper "Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities"
☆31Updated 8 months ago
younggyoseo / MWM
Masked World Models for Visual Control
☆131Updated 2 years ago
AGI-Labs / toto_benchmark
☆45Updated last year
video-to-action / video-to-action-release
[ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration
☆58Updated 5 months ago
amberxie88 / latent_diffusion_planning
Implementation of Latent Diffusion Planning (Amber Xie, Oleh Rybkin, Dorsa Sadigh, Chelsea Finn)
☆51Updated 4 months ago
UT-Austin-RPL / MUTEX
☆44Updated last year
singhgautam / steve
Official code for Slot-Transformer for Videos (STEVE)
☆50Updated 2 years ago
penn-pal-lab / LIV
Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)
☆125Updated 2 years ago
jsikyoon / OCRL
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Updated last year
csmile-1006 / REDS_agent
Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)
☆17Updated 6 months ago
hxixixh / AdaFlow
Official PyTorch implementation of AdaFlow
☆59Updated 11 months ago
rainbow979 / robodreamer
☆81Updated last year
mees / hulc2
[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data
☆46Updated 2 years ago
Dantong88 / LLARVA
☆59Updated 10 months ago
lmur98 / epic_kitchens_affordances
☆11Updated 2 years ago
xvjiarui / IMProv
IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks
☆57Updated last year
intuitive-robots / flower_vla_pret
[CoRL 2025] Pretraining code for FLOWER VLA on OXE
☆14Updated last month
thuml / ContextWM
Code release for "Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning" (NeurIPS 2023), https://ar…
☆67Updated last year
video-to-action / v2a-video-model-release
☆13Updated 5 months ago
mazpie / genrl
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆83Updated 6 months ago