x360dataset / x360dataset-kit
☆14Updated last month
Related projects: ⓘ
- ☆57Updated last year
- [ECCV 2024] Code for Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation☆24Updated 2 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆53Updated 5 months ago
- ☆38Updated 9 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆24Updated 4 months ago
- Official implementation of the CVPR'24 paper [Adaptive Slot Attention: Object Discovery with Dynamic Slot Number]☆19Updated 3 weeks ago
- ☆52Updated 2 months ago
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Updated last year
- The offical implemention of JM3D.☆27Updated 11 months ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆62Updated 5 months ago
- ☆26Updated last week
- ☆28Updated 9 months ago
- 😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D☆20Updated 2 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆35Updated last month
- ☆41Updated this week
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆19Updated last month
- A PyTorch implementation of TVC☆24Updated 9 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆22Updated 3 months ago
- Official PyTorch Implementation for Diffusion Hyperfeatures, NeurIPS 2023☆85Updated last week
- 😎 Awesome lists of papers and codes about Large Vision-Language Models☆12Updated 5 months ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆55Updated 5 months ago
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆29Updated 3 weeks ago
- ☆36Updated 4 months ago
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆51Updated last year
- ☆45Updated last year
- ☆25Updated 11 months ago
- [ICCV’23] Official repository for "TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation"☆19Updated 10 months ago
- [CVPR 2024] BIVDiff: A Training-free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models☆49Updated last week
- Code for the paper Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models @ CVPR 2024☆53Updated 3 months ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆61Updated 11 months ago