manantomar / video-occupancy-modelsLinks
☆11Updated last year
Alternatives and similar repositories for video-occupancy-models
Users that are interested in video-occupancy-models are comparing it to the libraries listed below
Sorting:
- [ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"☆26Updated 2 years ago
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- ☆54Updated 3 months ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆106Updated 3 months ago
- Code release for DriveGAN (CVPR 2021)☆96Updated 3 years ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆45Updated last month
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆147Updated 6 months ago
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆28Updated last year
- ☆22Updated 10 months ago
- ☆37Updated 8 months ago
- ☆33Updated 3 years ago
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆181Updated 4 years ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated 2 years ago
- A paper list of world model☆29Updated 6 months ago
- This repository hosts the code for our paper, "Simple and Effective Synthesis of Indoor 3D Scenes".☆41Updated 3 years ago
- This repository is a collection of research papers on World Models.☆41Updated 2 years ago
- ☆93Updated 3 months ago
- ☆14Updated 6 months ago
- Implementation of Danijar's latest iteration for his Dreamer line of work☆88Updated this week
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆43Updated 6 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆69Updated last year
- Code for "Recognizing Scenes from Novel Viewpoints"☆29Updated 3 years ago
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆244Updated this week
- ☆122Updated 8 months ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆17Updated 8 months ago
- [CVPR 2023] Code for "3D Concept Learning and Reasoning from Multi-View Images"☆80Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆77Updated 2 years ago
- VQVAE for video prediction☆29Updated 3 years ago
- A Transformer made of Rotation-equivariant Attention using Vector Neurons☆93Updated 2 years ago
- official implementation of the paper: Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transform…☆29Updated 2 years ago