manantomar / video-occupancy-modelsLinks
☆11Updated last year
Alternatives and similar repositories for video-occupancy-models
Users that are interested in video-occupancy-models are comparing it to the libraries listed below
Sorting:
- Code release for DriveGAN (CVPR 2021)☆96Updated 3 years ago
- [ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"☆27Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆100Updated 3 weeks ago
- This repository hosts the code for our paper, "Simple and Effective Synthesis of Indoor 3D Scenes".☆40Updated 3 years ago
- Code for paper Background Prompting for Improved Object Depth☆29Updated last year
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆26Updated last year
- ☆10Updated last year
- ☆13Updated 3 months ago
- ☆44Updated last week
- ☆21Updated 7 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆23Updated last week
- ☆32Updated 3 years ago
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆181Updated 3 years ago
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆22Updated 2 years ago
- VQVAE for video prediction☆27Updated 3 years ago
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆40Updated 2 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆66Updated last year
- Code for "Recognizing Scenes from Novel Viewpoints"☆29Updated 2 years ago
- ☆14Updated 9 months ago
- A paper list of world model☆28Updated 3 months ago
- ☆37Updated 5 months ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated 2 years ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆135Updated 3 months ago
- ☆22Updated 3 months ago
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆71Updated last year
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆33Updated 2 years ago
- This repository is a collection of research papers on World Models.☆39Updated last year
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆24Updated 2 months ago
- An operation trying to do the opposite of F.grid_sample☆20Updated last year