LiBingyu01 / FGA-segView external linksLinks
Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation
☆15Sep 24, 2025Updated 4 months ago
Alternatives and similar repositories for FGA-seg
Users that are interested in FGA-seg are comparing it to the libraries listed below
Sorting:
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".☆11Nov 13, 2024Updated last year
- [CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing☆40Jan 12, 2026Updated last month
- [CVPR'25] Official implementation of "Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation"☆43Oct 4, 2025Updated 4 months ago
- ☆22May 12, 2025Updated 9 months ago
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- ☆23Aug 20, 2024Updated last year
- [MICCAI 2024] VLSM-Adapter: Finetuning Vision-Language Segmentation Efficiently with Lightweight Blocks☆27Jan 13, 2026Updated last month
- ☆31Mar 5, 2025Updated 11 months ago
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆56Jun 16, 2025Updated 8 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Patch-free 3D Medical Image Segmentation☆36Dec 6, 2021Updated 4 years ago
- The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'☆34Nov 5, 2024Updated last year
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated last year
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆17Dec 25, 2025Updated last month
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Official implementation of LLaVa-Rad, a small multimodal model for chest X-ray findings generation.☆53Jan 22, 2026Updated 3 weeks ago
- [CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation☆46Mar 27, 2025Updated 10 months ago
- [ECCV 2024] SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation,☆49Mar 20, 2025Updated 10 months ago
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Oct 18, 2023Updated 2 years ago
- Build your rail application environment in a handy way☆12Dec 4, 2018Updated 7 years ago
- Multi-Organ Foundation Model for Universal Ultrasound Image Segmentation with Task Prompt and Anatomical Prior☆16Sep 30, 2024Updated last year
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- PyNoetic: A Modular Python Framework for No-Code Development of EEG Brain-Computer Interfaces☆25Jun 26, 2025Updated 7 months ago
- ☆11Jan 18, 2025Updated last year
- ☆10Apr 7, 2025Updated 10 months ago
- Dual-domain attention-guided low-dose CBCT reconstruction☆10Jun 17, 2022Updated 3 years ago
- ☆18Sep 5, 2024Updated last year
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- Project Page for CoPRS, offering training overview, inference code, and downloadable links.☆20Oct 27, 2025Updated 3 months ago
- accepted by MICCAI2024☆44Nov 28, 2024Updated last year
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆45Mar 25, 2025Updated 10 months ago
- ECCV24 "ReMamber: Referring Image Segmentation with Mamba Twister" official repository.☆44Jul 11, 2024Updated last year
- Aggregate and Discriminate: Pseudo Clips-Guided Boundary Perception for Video Moment Retrieval☆12Nov 25, 2024Updated last year
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆49Jan 8, 2025Updated last year