manantomar / video-occupancy-models
☆11Updated 8 months ago
Alternatives and similar repositories for video-occupancy-models:
Users that are interested in video-occupancy-models are comparing it to the libraries listed below
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆21Updated 2 years ago
- ☆13Updated last month
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated last year
- [ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"☆26Updated last year
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- A paper list of world model☆25Updated 10 months ago
- ☆21Updated 3 months ago
- ☆32Updated 3 years ago
- Compositional Object Light Fields code☆26Updated 2 years ago
- ☆10Updated 8 months ago
- Official implementation of: "Object-Centric Video Prediction via Decoupling of Object Dynamics and Interactions" by Villar-Corrales et al…☆16Updated last year
- VQVAE for video prediction☆27Updated 2 years ago
- ☆47Updated last month
- ☆88Updated 2 months ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆125Updated 9 months ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆47Updated 4 months ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆42Updated 5 months ago
- Official implementation of the 2024 ECCV paper SHIC: Shape-Image Correspondences with no Keypoint Annotation☆37Updated 5 months ago
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆26Updated 8 months ago
- [ICCV 2023] Learning Fine-Grained Features for Pixel-wise Video Correspondences☆17Updated last year
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated last year
- ☆14Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Independent PyTorch Implementation of Object Scene Representation Transformer☆48Updated last year
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆62Updated 9 months ago
- Source code release for "Leveraging Demonstrations with Latent Space Priors"☆40Updated 2 years ago
- An operation trying to do the opposite of F.grid_sample☆20Updated last year
- ☆28Updated last month
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆11Updated 2 weeks ago
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆67Updated last year