x360dataset / x360dataset-kitLinks
☆30Updated 4 months ago
Alternatives and similar repositories for x360dataset-kit
Users that are interested in x360dataset-kit are comparing it to the libraries listed below
Sorting:
- Offical repo for ICCV25 Highlight Paper: "ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric…☆51Updated last month
- Code for the paper "Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation", ECCV 2024☆44Updated last year
- ☆31Updated last year
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆52Updated 4 months ago
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆42Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆33Updated 8 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆80Updated last year
- [CVPR 2025] Open-World Amodal Appearance Completion☆41Updated last week
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆140Updated 10 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆96Updated last year
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"☆157Updated last month
- Visual Spatial Tuning☆133Updated this week
- Implementation of CamTrol: Training-free Camera Control for Video Generation☆28Updated last month
- Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models [CVPR 2025]☆75Updated 4 months ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Updated 8 months ago
- [WACV2025] Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆79Updated last year
- ☆40Updated 4 months ago
- ☆80Updated 5 months ago
- SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding☆58Updated 4 months ago
- ☆51Updated 6 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆54Updated 7 months ago
- [ICCV 2025 Oral] Official implementation of Learning Streaming Video Representation via Multitask Training.☆66Updated last month
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆269Updated last month
- Official code for MotionBench (CVPR 2025)☆59Updated 8 months ago
- VideoDirector [CVPR 2025]☆32Updated 7 months ago
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆54Updated last year
- Official Implementation of paper "A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence"☆341Updated last year
- Official PyTorch Implementation for Diffusion Hyperfeatures, NeurIPS 2023☆109Updated last year
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆60Updated 4 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"☆49Updated last month