YeeZ93 / Awesome-Object-Centric-Learning
A curated list of researches in object-centric learning
☆11Updated 7 months ago
Alternatives and similar repositories for Awesome-Object-Centric-Learning
Users that are interested in Awesome-Object-Centric-Learning are comparing it to the libraries listed below
Sorting:
- Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.☆72Updated last year
- Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision☆41Updated last month
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆56Updated last year
- Awesome paper for multi-modal llm with grounding ability☆17Updated 9 months ago
- ✌ CLoG: Benchmarking Continual Learning of Image Generation Models☆18Updated 11 months ago
- ☆41Updated 4 months ago
- This repository houses the code for the paper - "The Neglected of VLMs"☆28Updated last week
- [NeurIPS 2022] Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering☆47Updated last year
- [CVPR 2025] Few-shot Recognition via Stage-Wise Retrieval-Augmented Finetuning☆17Updated last month
- Continual Forgetting for Pre-trained Vision Models (CVPR 2024)☆64Updated last month
- The efficient tuning method for VLMs☆81Updated last year
- ☆16Updated 6 months ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated 2 years ago
- Code for CVPR2025 "MMRL: Multi-Modal Representation Learning for Vision-Language Models".☆33Updated last month
- Compress conventional Vision-Language Pre-training data☆51Updated last year
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆29Updated 6 months ago
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated 4 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆38Updated 11 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆35Updated 3 months ago
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆81Updated last year
- Collection of awesome Continual Test-Time Adaptation methods☆17Updated 11 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆36Updated last month
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆40Updated 4 months ago
- [ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models☆47Updated 10 months ago
- [ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation☆64Updated last year
- Official implementation of "Why are Visually-Grounded Language Models Bad at Image Classification?" (NeurIPS 2024)☆81Updated 6 months ago
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆89Updated 9 months ago
- [AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)☆39Updated last year
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆22Updated 5 months ago