Offical repo for ICCV25 Highlight Paper: "ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Perspectives"
☆54Oct 7, 2025Updated 4 months ago
Alternatives and similar repositories for ObjectRelator
Users that are interested in ObjectRelator are comparing it to the libraries listed below
Sorting:
- Repository for the paper : ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning☆22Mar 10, 2024Updated last year
- Repository for the CVPR-2023 paper : StyleAdv: Meta Style Adversarial Training for Cross-Domain Few-Shot Learning☆64Jun 2, 2025Updated 9 months ago
- 🚀 CCF DDL Tracker: a lightweight chrome extension for tracking CCF deadlines (Ongoing...)☆22Feb 16, 2026Updated 2 weeks ago
- Reasoning in Space via Grounding in the World☆50Nov 3, 2025Updated 4 months ago
- ☆14Dec 21, 2023Updated 2 years ago
- A toolbox of compositional scene representation learning methods and benchmark datasets.☆12Mar 2, 2024Updated 2 years ago
- [BMVC 2025] Occam’s LGS: An Efficient Approach for Language Gaussian Splatting☆60Nov 18, 2025Updated 3 months ago
- ☆20Mar 2, 2025Updated last year
- [NeurIPS 2025] Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection☆60Feb 2, 2026Updated last month
- Rui Qian, Xin Yin, Dejing Dou†: Reasoning to Attend: Try to Understand How <SEG> Token Works (CVPR 2025)☆51Feb 4, 2026Updated 3 weeks ago
- The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023☆18Jan 23, 2024Updated 2 years ago
- Official Implementation of VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Jo…☆23Jun 27, 2025Updated 8 months ago
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆36Nov 21, 2025Updated 3 months ago
- 🎨Official Repo for Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation☆55Apr 10, 2025Updated 10 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆36Dec 2, 2025Updated 3 months ago
- Unifying 2D and 3D Vision-Language Understanding☆121Jul 23, 2025Updated 7 months ago
- A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object…☆199Dec 10, 2025Updated 2 months ago
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆31Jun 5, 2025Updated 8 months ago
- Consistent Autoregressive Video Generation with Long Context☆67Feb 6, 2026Updated 3 weeks ago
- Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking☆33Mar 14, 2025Updated 11 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- ☆28Aug 6, 2025Updated 6 months ago
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆101Mar 14, 2025Updated 11 months ago
- (CVPRW2025) Solution of the NTIRE 2025 Challenge on Efficient Super-Resolution☆48Sep 7, 2025Updated 5 months ago
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- [ICCV 2025] MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance☆178Feb 11, 2026Updated 2 weeks ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- [ICRA 2026] StereoAdapter: Adapting Stereo Depth Estimation to Underwater Scenes☆20Feb 17, 2026Updated 2 weeks ago
- ☆41Dec 10, 2024Updated last year
- [CVPRW'25] Official Code for “Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Objec…☆51Oct 24, 2025Updated 4 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆115Oct 7, 2025Updated 4 months ago
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆129Nov 14, 2025Updated 3 months ago
- Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning☆80Feb 16, 2026Updated 2 weeks ago
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆41Dec 18, 2024Updated last year
- [ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆311Feb 11, 2026Updated 2 weeks ago
- ☆10Apr 7, 2025Updated 10 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- ☆44Jan 19, 2026Updated last month