manantomar / video-occupancy-modelsLinks
☆11Updated last year
Alternatives and similar repositories for video-occupancy-models
Users that are interested in video-occupancy-models are comparing it to the libraries listed below
Sorting:
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- [ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"☆26Updated last year
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆106Updated 2 months ago
- ☆53Updated last month
- Code release for DriveGAN (CVPR 2021)☆95Updated 3 years ago
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆27Updated last year
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆181Updated 3 years ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆145Updated 5 months ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆53Updated 9 months ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆44Updated 2 weeks ago
- Code for "Recognizing Scenes from Novel Viewpoints"☆29Updated 3 years ago
- ☆93Updated last month
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆69Updated last year
- ☆21Updated 9 months ago
- ☆37Updated 7 months ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated 2 years ago
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆77Updated last year
- Implementation of Dreamcraft3D, 3D content generation in Pytorch☆80Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 11 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆92Updated last year
- ☆13Updated 5 months ago
- ☆121Updated 7 months ago
- ☆33Updated 3 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 3 years ago
- VQVAE for video prediction☆27Updated 3 years ago
- ☆26Updated 2 years ago
- [TMLR 2024] Official PyTorch Implementation of Deep Dynamic Latent Particles☆16Updated last year
- This repository hosts the code for our paper, "Simple and Effective Synthesis of Indoor 3D Scenes".☆40Updated 3 years ago
- Evaluating pre-trained navigation agents under corruptions☆30Updated 4 years ago
- Compositional Object Light Fields code☆26Updated 2 years ago