360CVGroup / LMM-DetLinks
Make Large Multimodal Models excel in object detection, ICCV 2025
☆31Updated this week
Alternatives and similar repositories for LMM-Det
Users that are interested in LMM-Det are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)☆96Updated this week
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆35Updated 3 weeks ago
- ☆45Updated 7 months ago
- (NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"☆29Updated 4 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆37Updated 4 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆27Updated 3 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆51Updated 5 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆51Updated 11 months ago
- ☆77Updated 2 months ago
- [ECCV 2024] The official PyTorch implementation of the "Plain-Det: A Plain Multi-Dataset Object Detector".☆28Updated 7 months ago
- [ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families☆58Updated last year
- [ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models☆91Updated last month
- Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024☆35Updated 10 months ago
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆31Updated last month
- ☆12Updated 7 months ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (arXiv 2025)☆36Updated 2 months ago
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆113Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆86Updated 4 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆58Updated 9 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆82Updated last month
- ☆86Updated last year
- [AAAI 2025] Official implementation of the paper "EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation"☆29Updated 7 months ago
- ☆31Updated 7 months ago
- "Visual Prompt Selection for In-Context Learning Segmentation Framework"☆15Updated 7 months ago
- [NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion☆85Updated 2 months ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆29Updated last year
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆58Updated 3 weeks ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆80Updated 4 months ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆160Updated 7 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆72Updated 10 months ago