hhaAndroid / awesome-mm-chat
多模态 MM +Chat 合集
☆187Updated 2 weeks ago
Related projects: ⓘ
- [CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception☆474Updated 4 months ago
- Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)☆256Updated 6 months ago
- PromptDet: Towards Open-vocabulary Detection using Uncurated Images, ECCV2022☆159Updated 2 years ago
- The code of the paper "NExT-Chat: An LMM for Chat, Detection and Segmentation".☆204Updated 7 months ago
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆207Updated this week
- A DETR-style framework for open-vocabulary detection (OVD). CVPR 2023☆166Updated last year
- (CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling☆168Updated last month
- ☆123Updated 8 months ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆86Updated last year
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆176Updated last month
- [TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.☆104Updated 2 months ago
- [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design☆192Updated 10 months ago
- ☆111Updated last year
- A curated list of papers, datasets and resources pertaining to open vocabulary object detection.☆273Updated 2 months ago
- Collection of image and video datasets for generative AI and multimodal visual AI☆17Updated 4 months ago
- Fine tuning grounding Dino☆41Updated 3 weeks ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆88Updated 3 weeks ago
- Code release for our CVPR 2023 paper "Detecting Everything in the Open World: Towards Universal Object Detection".☆526Updated last year
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection☆128Updated 5 months ago
- Open-vocabulary Semantic Segmentation☆296Updated 4 months ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆98Updated last year
- ☆102Updated last year
- 这是一个DiT-pytorch的代码,主要用于学习DiT结构。☆59Updated 6 months ago
- A unified evaluation library for multiple machine learning libraries☆251Updated 5 months ago
- Research Code for Multimodal-Cognition Team in Ant Group☆111Updated 2 months ago
- Dense Distinct Query for End-to-End Object Detection (CVPR2023)☆244Updated last year
- PixelLM is an effective and efficient LMM for pixel-level reasoning and understanding. PixelLM is accepted by CVPR 2024.☆174Updated 3 months ago
- Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs☆72Updated 3 months ago
- CV算法工程师面试知识点整理☆26Updated last year
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆256Updated last year