[ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embodied Reference Understanding"
☆13Mar 20, 2023Updated 3 years ago
Alternatives and similar repositories for REP-ERU
Users that are interested in REP-ERU are comparing it to the libraries listed below
Sorting:
- [ICLR 2024] The official implementation of Zip-Your-Clip☆35Mar 14, 2024Updated 2 years ago
- [ECCV 2024] The official PyTorch implementation of the "Part2Object: Hierarchical Unsupervised 3D Instance Segmentation".☆25Sep 12, 2024Updated last year
- [ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".☆30Dec 8, 2024Updated last year
- [CVPR2025] Rethinking Query-based Transformer for Continual Image Segmentation☆44Jul 16, 2025Updated 8 months ago
- The official PyTorch implementation of the CVPR 2023 paper "Contrastive Grouping with Transformer for Referring Image Segmentation".☆50Apr 17, 2024Updated last year
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆98Mar 18, 2024Updated 2 years ago
- [ICCV2023] CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection☆19Apr 23, 2025Updated 10 months ago
- ☆19Jul 5, 2023Updated 2 years ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆26Apr 9, 2025Updated 11 months ago
- [NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models☆49Mar 18, 2024Updated 2 years ago
- ☆15Feb 26, 2026Updated 3 weeks ago
- SLSDeep: Skin Lesion Segmentation Based on Dilated Residual and Pyramid Pooling Networks☆14Jun 28, 2018Updated 7 years ago
- Animated dismissible alerts.☆11Jul 3, 2023Updated 2 years ago
- 2022年联力L216机箱 13600K+4090显卡装机笔记☆12Jan 17, 2023Updated 3 years ago
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆26Mar 9, 2024Updated 2 years ago
- ☆20Nov 11, 2019Updated 6 years ago
- [TOG 2025] Order Matters: Learning Element Ordering for Graphic Design Generation☆24Aug 5, 2025Updated 7 months ago
- Learning notes for implementing OpenCV image processing in Python on the Raspberry Pi #在树莓派上用Python实现OpenCV图像处理的学习笔记☆11Apr 11, 2019Updated 6 years ago
- Code associated with paper "Wandering Within a World: Online Contextualized Few-Shot Learning"☆25Jul 18, 2021Updated 4 years ago
- ☆16Nov 12, 2024Updated last year
- ☆21Dec 23, 2025Updated 2 months ago
- [NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".☆28Dec 18, 2025Updated 3 months ago
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆40Feb 26, 2024Updated 2 years ago
- This the official repository of OCL (ICCV 2023).☆26Mar 28, 2024Updated last year
- [ECCV2024] Nonverbal Interaction Detection☆29Oct 30, 2024Updated last year
- ☆13Jul 20, 2022Updated 3 years ago
- Code for "NVUM: Non-volatile Unbiased Memory for Robust Medical Classification" [MICCAI 2022 Early Accept]☆12Sep 6, 2022Updated 3 years ago
- Official repo of the paper “AL-GTD: Deep Active Learning for Gaze Target Detection” (ACMMM2024)☆12Nov 29, 2024Updated last year
- Task-Focused Few-Shot Object Detection Benchmark☆14Jun 24, 2025Updated 8 months ago
- Code for paper "Masked Pre-training Enables Universal Zero-shot Denoiser" [NeurIPS 2024].☆35Nov 20, 2024Updated last year
- Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"☆95Jun 22, 2023Updated 2 years ago
- Repository for Nature Communications paper entitled "Sleep-like Unsupervised Replay Reduces Catastrophic Forgetting in Artificial Neural …☆14Oct 28, 2022Updated 3 years ago
- This repository contains codes for AortaSeg24 Grand-Challenge.☆19Nov 22, 2024Updated last year
- CMIVQA☆18Jun 3, 2024Updated last year
- Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scra…☆54Apr 21, 2023Updated 2 years ago
- The Continual Learning App☆13Nov 3, 2021Updated 4 years ago
- Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models☆53Feb 23, 2026Updated 3 weeks ago
- Homepage:☆36Mar 13, 2026Updated last week
- [AAAI 2025] More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding☆26May 27, 2025Updated 9 months ago