[ECCV2024] Nonverbal Interaction Detection
☆29Oct 30, 2024Updated last year
Alternatives and similar repositories for NVI
Users that are interested in NVI are comparing it to the libraries listed below
Sorting:
- This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).☆18Jul 15, 2024Updated last year
- (ICCV23 Oral) LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning☆23Apr 11, 2024Updated last year
- [NeurIPS 2025] VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning☆64Jan 6, 2026Updated 2 months ago
- This is the official implementation of "Clustering Propagation for Universal Medical Image Segmentation" (Accepted at CVPR 2024).☆42Apr 11, 2024Updated last year
- This is the official implementation of "Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds" (Accepted at AAAI 2024).☆11May 4, 2024Updated last year
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Mar 4, 2022Updated 4 years ago
- ☆99Sep 5, 2023Updated 2 years ago
- [NeurIPS2023] Neural-Logic Human-Object Interaction Detection☆14Aug 24, 2024Updated last year
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆18May 7, 2025Updated 10 months ago
- ☆18Apr 20, 2025Updated 10 months ago
- [NeurIPS 2022 Spotlight] GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models☆184Jan 20, 2024Updated 2 years ago
- ☆14Nov 28, 2024Updated last year
- ☆17May 19, 2023Updated 2 years ago
- 🔘Official codes for "SocialCircle+: Learning the Angle-based Conditioned Interaction Representation for Pedestrian Trajectory Prediction…☆18Nov 17, 2025Updated 3 months ago
- ☆17Jun 21, 2022Updated 3 years ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆94Apr 27, 2023Updated 2 years ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆69Mar 14, 2024Updated last year
- ☆35Aug 26, 2024Updated last year
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆44Apr 9, 2022Updated 3 years ago
- ☆16Sep 17, 2025Updated 5 months ago
- [NeurIPS'2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models☆22Oct 21, 2025Updated 4 months ago
- (ICCV2019) Learning Compositional Neural Infomation Fusion for Human Parsing☆37Jul 25, 2024Updated last year
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Jan 20, 2024Updated 2 years ago
- Official code of ACM MM2024 paper- Unseen No More: Unlocking the Potential of CLIP for Generative Zero-shot HOI Detection☆24Aug 15, 2024Updated last year
- (ICLR25 Oral) Do as We Do, Not as You Think: the Conformity of Large Language Models☆40Feb 6, 2026Updated last month
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆137Sep 9, 2024Updated last year
- Official PyTorch implementation of the ICML 2024 paper "Hyperbolic Active Learning for Semantic Segmentation under Domain Shift"☆26Nov 26, 2024Updated last year
- Semi-supervised Semantic Segmentation on the ImageNet-S dataset☆21Mar 20, 2023Updated 2 years ago
- (TMI-2024) Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery☆25Nov 13, 2024Updated last year
- Official code of the paper ORacle: Large Vision-Language Models for Knowledge-Guided Holistic OR Domain Modeling accepted at MICCAI 2024.☆24Jan 6, 2025Updated last year
- [TMI'22]Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation☆23Dec 20, 2022Updated 3 years ago
- Official code of the paper 4D-OR: Semantic Scene Graphs for OR Domain Modeling accepted at MICCAI 2022. This repo includes both the datas…☆63Mar 29, 2025Updated 11 months ago
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆65Jun 26, 2024Updated last year
- ☆25Apr 16, 2022Updated 3 years ago
- Video Feature Enhancement with PyTorch☆32Nov 28, 2024Updated last year
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆30Jul 16, 2024Updated last year
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆89Jul 4, 2024Updated last year
- Segment-Anything-2 (SAM 2) fine tune with COCO data☆14Aug 20, 2024Updated last year
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆37Dec 17, 2024Updated last year