manantomar / video-occupancy-modelsLinks
☆12Updated last year
Alternatives and similar repositories for video-occupancy-models
Users that are interested in video-occupancy-models are comparing it to the libraries listed below
Sorting:
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- Code release for DriveGAN (CVPR 2021)☆98Updated 4 years ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆111Updated 6 months ago
- [ICML 2022] Official PyTorch implementation of the paper "Unsupervised Image Representation Learning with Deep Latent Particles"☆26Updated 2 years ago
- ☆14Updated 10 months ago
- This repository hosts the code for our paper, "Simple and Effective Synthesis of Indoor 3D Scenes".☆42Updated 3 years ago
- Code for "Recognizing Scenes from Novel Viewpoints"☆29Updated 3 years ago
- [ICLR'24] GTA: A Geometry-Aware Attention Mechanism for Multi-view Transformers☆151Updated last month
- Code for paper Background Prompting for Improved Object Depth☆29Updated 2 years ago
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆182Updated 4 years ago
- Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch☆55Updated last year
- Implementation of Dreamcraft3D, 3D content generation in Pytorch☆81Updated 2 years ago
- HD-EPIC Python script to download the entire datasets or parts of it☆17Updated 4 months ago
- VQVAE for video prediction☆31Updated 3 years ago
- ☆95Updated 6 months ago
- ☆38Updated last year
- Official implementation of the paper "EgoPet: Egomotion and Interaction Data from an Animal's Perspective".☆29Updated last month
- ☆130Updated 11 months ago
- Official implementation of DIP: Unsupervised Dense In-Context Post-training of Visual Representations☆46Updated 5 months ago
- ☆22Updated last year
- A Transformer made of Rotation-equivariant Attention using Vector Neurons☆101Updated 2 years ago
- ☆33Updated 3 years ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Updated 6 months ago
- (CVPR 2023) Seeing a Rose in Five Thousand Ways☆119Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated last year
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆44Updated 9 months ago
- [CVPR 2024 Highlight] SPOT: Self-Training with Patch-Order Permutation for Object-Centric Learning with Autoregressive Transformers☆73Updated last year
- [NeurIPS 2022] code for "Visual Concepts Tokenization"☆23Updated 3 years ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆144Updated last year