☆17Nov 29, 2024Updated last year
Alternatives and similar repositories for Hi-Mapper
Users that are interested in Hi-Mapper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)☆55Sep 7, 2023Updated 2 years ago
- Pin the Memory (CVPR 2022): Official Project Webpage☆44Nov 3, 2022Updated 3 years ago
- Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024☆13Jan 3, 2024Updated 2 years ago
- [ECCV 2024 Oral] Code for our paper "A Fair Ranking and New Model for Panoptic Scene Graph Generation"☆16Dec 2, 2025Updated 3 months ago
- Code and data for the paper "Emergent Visual-Semantic Hierarchies in Image-Text Representations" (ECCV 2024)☆34Aug 12, 2024Updated last year
- Official PyTorch Implementation of HIER: Metric Learning Beyond Class Labels via Hierarchical Regularization, CVPR 2023☆67Sep 3, 2023Updated 2 years ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆30Feb 4, 2024Updated 2 years ago
- Official Pytorch implementation of "Improved Probabilistic Image-Text Representations" (ICLR 2024)☆60May 26, 2024Updated last year
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆21Oct 8, 2024Updated last year
- This repository is related to 'Intriguing Properties of Hyperbolic Embeddings in Vision-Language Models', published at TMLR (2024), https…☆22Jul 5, 2024Updated last year
- [AAAI 2024] Official code for "Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Op…☆16Feb 14, 2024Updated 2 years ago
- Code for "HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation" CVPR2024☆10Apr 19, 2024Updated last year
- ☆16Dec 11, 2024Updated last year
- ☆36May 12, 2025Updated 10 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆20Mar 8, 2026Updated 2 weeks ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆49Jan 8, 2025Updated last year
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 3 months ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆25Feb 27, 2026Updated 3 weeks ago
- 📄 A curated list of visual reasoning papers.☆31Mar 4, 2026Updated 2 weeks ago
- [AAAI-24] VVS : Video-to-Video Retrieval With Irrelevant Frame Suppression☆20May 14, 2024Updated last year
- Manifold aware pytorch layers☆13Apr 25, 2020Updated 5 years ago
- PU-Transformer: Point Cloud Upsampling Transformer (ACCV 2022)☆12Nov 24, 2022Updated 3 years ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆50Sep 24, 2024Updated last year
- Code Implementation of "Simple Image-level Classification Improves Open-vocabulary Object Detection" (AAAI'24)☆29Jan 12, 2024Updated 2 years ago
- [NeurIPS 2023] Understanding and Improving Feature Learning for Out-of-Distribution Generalization☆29May 27, 2025Updated 9 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts