[CVPR 2022] Visual Abductive Reasoning
☆124Oct 22, 2024Updated last year
Alternatives and similar repositories for VAR
Users that are interested in VAR are comparing it to the libraries listed below
Sorting:
- Official Pytorch implementation of "Visual Recognition with Deep Nearest Centroids". (ICLR2023 Spotlight)☆69Feb 1, 2023Updated 3 years ago
- [ECCV2024] Nonverbal Interaction Detection☆29Oct 30, 2024Updated last year
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- [NeurIPS 2022 Spotlight] GMMSeg: Gaussian Mixture based Generative Semantic Segmentation Models☆184Jan 20, 2024Updated 2 years ago
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Mar 4, 2022Updated 3 years ago
- [TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation☆48Jan 20, 2024Updated 2 years ago
- This is the official implementation of "GvSeg: General and Task-Oriented Video Segmentation" (Accepted at ECCV 2024).☆18Jul 15, 2024Updated last year
- (ICCV23 Oral) LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning☆23Apr 11, 2024Updated last year
- [CVPR'24] Neural Clustering based Visual Representation Learning☆44Oct 6, 2025Updated 4 months ago
- ☆14Dec 11, 2024Updated last year
- [NeurIPS 2022 Spotlight] Learning Equivariant Segmentation with Instance-Unique Querying☆22Dec 17, 2022Updated 3 years ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆94Apr 27, 2023Updated 2 years ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆89Feb 6, 2026Updated 3 weeks ago
- This is the official implementation of "Interpretable3D: An Ad-Hoc Interpretable Classifier for 3D Point Clouds" (Accepted at AAAI 2024).☆11May 4, 2024Updated last year
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆123Apr 12, 2024Updated last year
- ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning☆138Mar 16, 2023Updated 2 years ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆29Sep 27, 2024Updated last year
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes☆21May 23, 2023Updated 2 years ago
- The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.☆20Feb 22, 2023Updated 3 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Aug 19, 2023Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆53Sep 22, 2023Updated 2 years ago
- [SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval.☆134May 4, 2022Updated 3 years ago
- Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution☆25Mar 18, 2021Updated 4 years ago
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆44Apr 9, 2022Updated 3 years ago
- Cross Modal Retrieval with Querybank Normalisation☆57Nov 21, 2023Updated 2 years ago
- Code for Point-Calibrated Spectral Neural Operators☆20Oct 15, 2024Updated last year
- ☆14Jan 16, 2024Updated 2 years ago
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Oct 20, 2022Updated 3 years ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆34Aug 12, 2024Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated 10 months ago
- ☆17Jun 21, 2022Updated 3 years ago
- 📄 A curated list of visual reasoning papers.☆31Nov 1, 2025Updated 4 months ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆91Nov 16, 2022Updated 3 years ago
- ICLR 2023 - FedFA: Federated Feature Augmentation☆59Mar 28, 2023Updated 2 years ago
- CVPR2022 - Deep Hierarchical Semantic Segmentation - A structured, pixel-wise description of visual scenes in terms of the class hierarch…☆253Apr 24, 2023Updated 2 years ago
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆30Jul 16, 2024Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Sep 5, 2022Updated 3 years ago