[ICCV2023] PyTorch implementation of ''Spatial-Aware Token for Weakly Supervised Object Localization''.
☆23Oct 24, 2023Updated 2 years ago
Alternatives and similar repositories for SAT
Users that are interested in SAT are comparing it to the libraries listed below
Sorting:
- [CVPR2022] PyTorch implementation of ''Background Activation Suppression for Weakly Supervised Object Localization''.☆44Sep 25, 2023Updated 2 years ago
- ViTOL☆32Jun 28, 2022Updated 3 years ago
- [AAAI 2022] Pytorch implementation of "LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization".☆22Jul 13, 2022Updated 3 years ago
- Weakly Supervised Object Localization via Class RE-Activation Mapping☆12Sep 19, 2022Updated 3 years ago
- ☆29Mar 15, 2023Updated 2 years ago
- QWEN 2.5VL-R1: Multimodal reasoning model for action recognition in videos (Experimental GRPO with LoRA support)☆22Oct 9, 2025Updated 4 months ago
- [MICCAI 2024] Repository for "ASPS: Augmented Segment Anything Model for Polyp Segmentation"☆46Mar 3, 2025Updated 11 months ago
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆18Oct 7, 2024Updated last year
- Weakly Supervised Object Localization Paper List☆40Dec 6, 2024Updated last year
- [AAAI 2025] Explore In-Context Segmentation via Latent Diffusion Models☆22Mar 25, 2025Updated 11 months ago
- ☆20Oct 26, 2024Updated last year
- This is the official implementation for our NeurIPS 2023 paper "Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation"…☆22Mar 26, 2024Updated last year
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆57Nov 10, 2023Updated 2 years ago
- Implementation of Multiple Instance Detection Network with Online Instance Classifier Refinement with PyTorch☆24Dec 7, 2022Updated 3 years ago
- TRT for WSOL☆30Oct 31, 2023Updated 2 years ago
- Code release for paper "Pseudo-label Alignment for Semi-supervised Instance Segmentation" [ICCV 2023]☆30Dec 21, 2023Updated 2 years ago
- [CVPR 2024] Guided Slot Attention for Unsupervised Video Object Segmentation☆63Dec 23, 2024Updated last year
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Oct 15, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Official implementation of "Diffusion-Driven Two-Stage Active Learning for Low-Budget Semantic Segmentation" (NeurIPS 2025)☆18Dec 2, 2025Updated 3 months ago
- Codes for TS-CAM: Token Semantic Coupled Attention Map for Weakly Supervised Object Localization.☆143Feb 16, 2023Updated 3 years ago
- code repository of “Rethinking the Route Towards Weakly Supervised Object Localization” in CVPR 2020☆69Sep 14, 2020Updated 5 years ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated 11 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆16Jul 24, 2025Updated 7 months ago
- A large scale inpainting & t2i anime image dataset☆14Oct 18, 2025Updated 4 months ago
- ☆11Jan 18, 2025Updated last year
- Third-party PyTorch implementation of DDT(Unsupervised object discovery and co-localization by deep descriptor transformation)☆37Dec 27, 2018Updated 7 years ago
- Benchmark for Multi-domain Evaluation of Semantic Segmentation☆44Aug 25, 2024Updated last year
- Official repository of the "Shatter and Gather: Learning Referring Image Segmentation with Text Supervision (ICCV'23)"☆42Jan 29, 2024Updated 2 years ago
- ☆18Aug 7, 2025Updated 6 months ago
- Unveiling the Potential of Segment Anything Model 2 for RGB-Thermal Semantic Segmentation with Language Guidance☆13Nov 27, 2025Updated 3 months ago
- Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grai…☆24Jun 4, 2025Updated 8 months ago
- [IJCV 2025] The official implementation of "AnyPattern: Towards In-context Image Copy Detection"☆10Oct 24, 2025Updated 4 months ago
- 为视障人群生成电影,输入是电影剧本和mkv格式电影,输出为带有解说的电影☆12Jul 28, 2019Updated 6 years ago
- Image Manipulation Detection and Localization☆10Aug 10, 2023Updated 2 years ago