huiwon-jang / RSP
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆18Updated 3 months ago
Alternatives and similar repositories for RSP:
Users that are interested in RSP are comparing it to the libraries listed below
- Official Code for Neural Systematic Binder☆32Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated 5 months ago
- ☆56Updated last year
- VP2 Benchmark (A Control-Centric Benchmark for Video Prediction, ICLR 2023)☆27Updated 2 weeks ago
- 📎 + 🦾 CLIP-RT: Learning Language-Conditioned Robotic Policies from Natural Language Supervision☆14Updated 4 months ago
- Code for Stable Control Representations☆23Updated 2 months ago
- ☆42Updated 10 months ago
- Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models☆105Updated last year
- Official code for Slot-Transformer for Videos (STEVE)☆49Updated 2 years ago
- ☆43Updated last year
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆42Updated 2 months ago
- Official Implementation of CL-ALFRED (ICLR'24)☆20Updated 4 months ago
- ☆65Updated 6 months ago
- ☆16Updated 2 years ago
- ☆46Updated 3 months ago
- Personal Python toolbox☆15Updated 8 months ago
- An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment☆13Updated 2 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆35Updated last month
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- Implantation of CtrlFormer☆28Updated 2 years ago
- Instruction Following Agents with Multimodal Transforemrs☆52Updated 2 years ago
- The repository of ICML2023 paper: On Pre-Training for Visuo-Motor Control: Revisiting a Learning-from-Scratch Baseline☆23Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆42Updated last year
- ☆67Updated last month
- Masked World Models for Visual Control☆120Updated last year
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆90Updated 2 years ago
- ☆41Updated last year
- ☆13Updated 9 months ago