tue-mps / eomtLinks
[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).
☆439Updated last week
Alternatives and similar repositories for eomt
Users that are interested in eomt are comparing it to the libraries listed below
Sorting:
- [CVPR 2025 Highlight] Official repository for the paper: "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"☆331Updated last month
- Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning☆249Updated last month
- Testing adaptation of the DINOv2/3 encoders for vision tasks with Low-Rank Adaptation (LoRA)☆290Updated 3 weeks ago
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆485Updated last month
- [NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."☆123Updated 2 weeks ago
- This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously impro…☆115Updated 3 weeks ago
- Repository of the paper "AnyUp: Universal Feature Upsampling".☆282Updated last week
- Muggled SAM: Segmentation without the magic☆166Updated last month
- [CVPR 2025] "A Distractor-Aware Memory for Visual Object Tracking with SAM2"☆425Updated 3 weeks ago
- [CVPR 2024] Code release for "Unsupervised Universal Image Segmentation"☆221Updated last year
- One summary of efficient segment anything models☆109Updated last year
- Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.☆261Updated last year
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆361Updated last year
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆278Updated 4 months ago
- Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such…☆263Updated 2 years ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆476Updated 7 months ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆261Updated 6 months ago
- [ICML 2025] Official Implementation for SimDINO/SimDINOv2☆176Updated 7 months ago
- [ICCV2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary…☆118Updated last week
- The Missing Point in Vision Transformers for Universal Image Segmentation☆54Updated 4 months ago
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆196Updated 3 months ago
- Loss Functions in the Era of Semantic Segmentation: A Survey and Outlook☆65Updated last year
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆69Updated last year
- [CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"☆110Updated this week
- [NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim☆343Updated last month
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆451Updated 2 months ago
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆330Updated last year
- Scaling Vision Pre-Training to 4K Resolution☆209Updated last month
- This is the official code release for our work, Denoising Vision Transformers.☆381Updated 11 months ago
- ☆128Updated last year