bytedance / coconut_cvpr2024View external linksLinks
☆201May 19, 2025Updated 8 months ago
Alternatives and similar repositories for coconut_cvpr2024
Users that are interested in coconut_cvpr2024 are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆337Feb 5, 2024Updated 2 years ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆99Jul 15, 2024Updated last year
- [CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"☆211Jun 9, 2024Updated last year
- a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.☆81Jul 28, 2023Updated 2 years ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆73Jun 26, 2025Updated 7 months ago
- [ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation☆393Sep 19, 2023Updated 2 years ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆504Aug 9, 2024Updated last year
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆1,028Aug 4, 2025Updated 6 months ago
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Mar 20, 2025Updated 10 months ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆103Jan 30, 2024Updated 2 years ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆159Dec 6, 2024Updated last year
- ☆13Jul 20, 2024Updated last year
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆529Apr 8, 2024Updated last year
- Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,342Oct 15, 2025Updated 4 months ago
- [ECCV 2024] Tokenize Anything via Prompting☆603Dec 11, 2024Updated last year
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆131Dec 3, 2023Updated 2 years ago
- ☆28Apr 4, 2025Updated 10 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆496Mar 17, 2025Updated 10 months ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆60Dec 17, 2023Updated 2 years ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆91Apr 9, 2024Updated last year
- ☆134Jul 4, 2024Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- This repo contains the code for our paper Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation☆17Mar 20, 2025Updated 10 months ago
- Boundary IoU API (Beta version)☆234Aug 13, 2024Updated last year
- [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"☆748Jan 22, 2024Updated 2 years ago
- ☆10Jul 5, 2024Updated last year
- ☆11Oct 20, 2023Updated 2 years ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆18Apr 2, 2025Updated 10 months ago
- This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…☆290Feb 12, 2024Updated 2 years ago
- [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…☆1,494Dec 20, 2023Updated 2 years ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆47Jun 16, 2024Updated last year
- Official Implementation of ICCV 2023 Paper - SegPrompt: Boosting Open-World Segmentation via Category-level Prompt Learning☆111May 28, 2025Updated 8 months ago
- [ICLR 2025] SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement☆83Apr 19, 2025Updated 9 months ago
- ☆22Jun 30, 2023Updated 2 years ago
- Official code for CAVIS: Context-Aware Video Instance Segmentation☆95Sep 17, 2025Updated 4 months ago
- [ECCV2024] This is an official implementation for "PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model"☆269Dec 30, 2024Updated last year
- [ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"☆2,808Jul 10, 2025Updated 7 months ago
- ☆71Dec 6, 2023Updated 2 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,342Oct 5, 2023Updated 2 years ago