better-chao / perceptual_abilities_evaluationLinks
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation
☆112Updated 2 weeks ago
Alternatives and similar repositories for perceptual_abilities_evaluation
Users that are interested in perceptual_abilities_evaluation are comparing it to the libraries listed below
Sorting:
- ☆89Updated last year
- Official implementation of ECCV2022 paper End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution☆101Updated 2 years ago
- Domain-Controlled Prompt Learning (AAAI2024)☆88Updated 9 months ago
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆157Updated 10 months ago
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆79Updated 7 months ago
- [CVPR 2024] Interactive continual learning: Fast and slow thinking☆102Updated last year
- [ICLR 25] The implementation of paper Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision.☆52Updated last month
- (TCSVT 2024) Unsupervised Domain Adaption Harnessing Vision-Language Pre-training☆38Updated last year
- [NeurIPS'24] Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation☆62Updated 8 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆111Updated 10 months ago
- ☆27Updated 3 weeks ago
- [NeurIPS' 24] Official implementation of the paper "Cloud Object Detector Adaptation by Integrating Different Source Knowledge"☆37Updated 5 months ago
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆34Updated last year
- ☆71Updated 2 years ago
- A Unified Baseline Tracker for Multimodal Single and Multiple Object Tracking☆48Updated 10 months ago
- UniInst☆98Updated last year
- Official repository of Expert-Controlled Classifier-Free Guidance for Reliable Medical Visual Question Answering.☆41Updated last month
- ☆37Updated last year
- Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.☆121Updated 2 months ago
- ☆80Updated 9 months ago
- The official project website of "Augmentation-free Dense Contrastive Distillation for Efficient Semantic Segmentation" (Af-DCD for short,…☆17Updated last year
- 3D-RCNet: A 3D Relational Convolutional Network for Hyperspectral Image Classification☆92Updated 4 months ago
- The official project website of "ScaleKD: Strong Vision Transformers Could Be Excellent Teachers" (ScaleKD for short, accepted to NeurIPS…☆63Updated 7 months ago
- Official Code of Logits-Based-Finetuning☆87Updated 2 months ago
- ☆44Updated 4 months ago
- The code repository of TBAC-UniImage☆114Updated last week
- Camouflaged Object Detection☆20Updated last month
- ☆60Updated last week
- CheX-Phi3.5V is a vision-language model (VLM) for chest X-ray interpretation.☆23Updated 4 months ago
- This is the source code for paper "Unsupervised Adversarial Domain Adaptation for Cross-domain Face Presentation Attack Detection"☆77Updated 4 years ago