deepglint / MLCD-SegLinks
MLCD-Seg is a zero-shot segmentation model from DeepGlint.
☆17Updated 7 months ago
Alternatives and similar repositories for MLCD-Seg
Users that are interested in MLCD-Seg are comparing it to the libraries listed below
Sorting:
- [ACM MM2025] The official repository for the RealSyn dataset☆40Updated last month
- Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning☆142Updated 7 months ago
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆15Updated last year
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆109Updated last year
- ☆26Updated 2 years ago
- An official code for MogFace☆86Updated 2 years ago
- BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild☆33Updated last year
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆20Updated 4 months ago
- Fully Open Framework for Democratized Multimodal Reinforcement Learning.☆39Updated last month
- The official codes and datasets for Artistic Text Segmentation (ECCV 2024).☆28Updated 4 months ago
- ☆22Updated 2 years ago
- This is the official implementation of "Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors", which is accepted at…☆86Updated 3 weeks ago
- Official code for fast face classification☆103Updated 2 months ago
- ☆78Updated 10 months ago
- ☆22Updated last year
- DAA: A Delta Age AdaIN operation for age estimation via binary code transformer (CVPR2023)☆37Updated 11 months ago
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆49Updated last year
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆89Updated last month
- [WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆44Updated 3 months ago
- Official repository for "ARoFace: Alignment Robustness to Improve Low-Quality Face Recognition" ECCV24☆62Updated last year
- ☆17Updated 2 years ago
- ☆53Updated last year
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13Updated 2 years ago
- ☆118Updated last month
- ChineseCLIP using online learning☆13Updated 3 years ago
- Zero-label image classification via OpenCLIP knowledge distillation☆142Updated 2 years ago
- ☆25Updated last year
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆33Updated 6 months ago
- This is a PyTorch implementation of "VirFace: Enhancing Face Recognition via Unlabeled Shallow Data" (CVPR 2021).☆22Updated 3 years ago