☆201Feb 12, 2026Updated last month
Alternatives and similar repositories for coconut_cvpr2024
Users that are interested in coconut_cvpr2024 are comparing it to the libraries listed below
Sorting:
- a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.☆81Jul 28, 2023Updated 2 years ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆100Jul 15, 2024Updated last year
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated last year
- [CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"☆211Jun 9, 2024Updated last year
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆339Feb 5, 2024Updated 2 years ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆75Jun 26, 2025Updated 8 months ago
- ☆28Apr 4, 2025Updated 11 months ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆20Apr 2, 2025Updated 11 months ago
- ☆136Jul 4, 2024Updated last year
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆104Jan 30, 2024Updated 2 years ago
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆1,029Aug 4, 2025Updated 7 months ago
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆393Sep 19, 2023Updated 2 years ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆506Aug 9, 2024Updated last year
- This repo contains the code for our paper Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation☆18Mar 20, 2025Updated last year
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆497Mar 17, 2025Updated last year
- Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,344Oct 15, 2025Updated 5 months ago
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆531Apr 8, 2024Updated last year
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆159Dec 6, 2024Updated last year
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆91Apr 9, 2024Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆132Dec 3, 2023Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…☆1,504Dec 20, 2023Updated 2 years ago
- [ECCV 2024] Tokenize Anything via Prompting☆602Dec 11, 2024Updated last year
- This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…☆293Feb 12, 2024Updated 2 years ago
- ☆10Jul 5, 2024Updated last year
- ☆71Dec 6, 2023Updated 2 years ago
- Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)☆12May 7, 2025Updated 10 months ago
- ☆13Jul 20, 2024Updated last year
- Fast and general video object segmentation evaluation.☆36Jan 30, 2024Updated 2 years ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated last year
- This repo contains the code for 1D tokenizer and generator☆1,129Mar 20, 2025Updated last year
- ☆28Jul 30, 2024Updated last year
- ☆17Feb 26, 2024Updated 2 years ago
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,813Jul 10, 2025Updated 8 months ago
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"☆270Dec 30, 2024Updated last year
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,341Oct 5, 2023Updated 2 years ago
- Official implementation of "Perturbed-Attention Guidance"☆60Jul 2, 2024Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- [ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything☆268Apr 11, 2025Updated 11 months ago
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆749Jan 22, 2024Updated 2 years ago