Ryoo72 / UNALinks
Universal-Noise Annotation
☆23Updated last year
Alternatives and similar repositories for UNA
Users that are interested in UNA are comparing it to the libraries listed below
Sorting:
- Official implementation of "URECA : Unique Region Caption Anything"☆53Updated 3 months ago
- Official implementation of "MoA: Mixture-of-Adapters" (WACV 2025)☆28Updated last year
- Official Implementation of "Towards Open-Vocabulary Semantic Segmentation without Semantic Labels" (NeurIPS 2024)☆53Updated last year
- Official repository for CATs++: Boosting Cost Aggregation with Convolutions and Transformers (TPAMI'22)☆49Updated last year
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Updated 7 months ago
- ☆26Updated last year
- ☆20Updated 8 months ago
- Official pytorch implementation of "Towards Practical Plug-and-Play Diffusion Models" in CVPR2023☆22Updated 2 years ago
- [ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation☆46Updated last year
- Official implementation of "Referring Video Object Segmentation via Language Aligned Track Selection".☆40Updated 5 months ago
- ☆14Updated 2 years ago
- Official Implementation of "Multi-Granularity Video Object Segmentation" (AAAI 2025)☆24Updated 10 months ago
- Official implementation of "VIRAL: Visual Representation Alignment for MLLMs".☆134Updated last month
- [CVPR'2025] EntitySAM: Segment Everything in Video☆51Updated 3 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆49Updated 4 months ago
- Official implementation of "S⁴M: Boosting Semi-Supervised Instance Segmentation with SAM" (ICCV 2025)☆30Updated 3 months ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆18Updated 3 months ago
- [ECCV 2024] VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement☆35Updated last year
- This repository is for the paper "Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding…☆20Updated 2 years ago
- Official repository for SuperCATs : Cost Aggregation with Transformers for Sparse Correspondence (ICCE-Asia'22)☆18Updated 2 years ago
- ☆32Updated last year
- Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆84Updated 4 months ago
- Official implementation of "Retrieval-Augmented Score Distillation for Text-to-3D Generation"☆54Updated 10 months ago
- [NeurIPS 2024] Understanding Multi-Granularity for Open-Vocabulary Part Segmentation☆54Updated 10 months ago
- [CVPR 2024 Highlight] ImageNet-D☆44Updated last year
- (ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation☆37Updated 2 years ago
- ☆13Updated 2 years ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆50Updated 11 months ago
- Official code implementation of NeMF (NeurIPS'22)☆81Updated 2 years ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Updated last year