bytedance/coconut_cvpr2024

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bytedance/coconut_cvpr2024)

bytedance / coconut_cvpr2024

☆206

Alternatives and similar repositories for coconut_cvpr2024

Users that are interested in coconut_cvpr2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bytedance / kmax-deeplab
View on GitHub
a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.
☆80Jul 28, 2023Updated 2 years ago
bytedance / OmniScient-Model
View on GitHub
This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model
☆102Jul 15, 2024Updated 2 years ago
TACJu / Axial-VS
View on GitHub
This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
☆27Mar 20, 2025Updated last year
Beckschen / ViTamin
View on GitHub
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
☆211Jun 9, 2024Updated 2 years ago
bytedance / fc-clip
View on GitHub
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…
☆345Feb 5, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hustvl / GroundingSuite
View on GitHub
[ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding
☆77Jun 26, 2025Updated last year
zhang-tao-whu / DVIS_Plus
View on GitHub
☆140Jul 4, 2024Updated 2 years ago
dlsrbgg33 / Video-3DGS
View on GitHub
☆28Apr 4, 2025Updated last year
Ali2500 / ViCaS
View on GitHub
ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)
☆21Apr 2, 2025Updated last year
segments-ai / latent-diffusion-segmentation
View on GitHub
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]
☆108Jan 30, 2024Updated 2 years ago
HarborYuan / ovsam
View on GitHub
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
☆1,031Aug 4, 2025Updated 11 months ago
facebookresearch / VLPart
View on GitHub
[ICCV2023] VLPart: Going Denser with Open-Vocabulary Part Segmentation
☆395Sep 19, 2023Updated 2 years ago
OpenGVLab / all-seeing
View on GitHub
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …
☆507Aug 9, 2024Updated last year
TACJu / Compositor
View on GitHub
This repo contains the code for our paper Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation
☆18Mar 20, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SkyworkAI / DAQ-VS
View on GitHub
Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]
☆15Jul 11, 2024Updated 2 years ago
hkchengrex / vos-benchmark
View on GitHub
Fast and general video object segmentation evaluation.
☆36Jan 30, 2024Updated 2 years ago
hustvl / EVF-SAM
View on GitHub
Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"
☆505Mar 17, 2025Updated last year
lxtGH / OMG-Seg
View on GitHub
Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]
☆1,350Oct 15, 2025Updated 9 months ago
UX-Decoder / DINOv
View on GitHub
[CVPR 2024] Official implementation of the paper "Visual In-context Learning"
☆542Apr 8, 2024Updated 2 years ago
fcjian / InstaGen
View on GitHub
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024
☆92Apr 9, 2024Updated 2 years ago
ByteDance-Seed / DeepFlow
View on GitHub
[ICCV 2025] Deeply Supervised Flow-Based Generative Models
☆38Jun 26, 2025Updated last year
baaivision / DenseFusion
View on GitHub
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
☆159Dec 6, 2024Updated last year
LiheYoung / FreeMask
View on GitHub
[NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models
☆133Dec 3, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
facebookresearch / paco
View on GitHub
This repo contains documentation and code needed to use PACO dataset: data loaders and training and evaluation scripts for objects, parts…
☆300Feb 12, 2024Updated 2 years ago
baaivision / tokenize-anything
View on GitHub
[ECCV 2024] Tokenize Anything via Prompting
☆601Dec 11, 2024Updated last year
RobertLuo1 / CoHD
View on GitHub
The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆27Aug 17, 2025Updated 11 months ago
IDEA-Research / MaskDINO
View on GitHub
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segme…
☆1,543Dec 20, 2023Updated 2 years ago
ytaek-oh / vl_compo
View on GitHub
☆10Jul 5, 2024Updated 2 years ago
amazon-far / BAR
View on GitHub
[ICML 2026] code & model for arxiv paper "Autoregressive Image Generation with Masked Bit Modeling"
☆59May 1, 2026Updated 2 months ago
rkzheng99 / TMT-VIS
View on GitHub
Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)
☆12May 7, 2025Updated last year
fanq15 / Stable-SAM
View on GitHub
☆73Dec 6, 2023Updated 2 years ago
lxtGH / Tube-Link
View on GitHub
[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS
☆109Mar 18, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,166Mar 20, 2025Updated last year
wusize / CLIPSelf
View on GitHub
[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
☆207Feb 5, 2024Updated 2 years ago
xushilin1 / RMP-SAM
View on GitHub
[ICLR 2025 oral] RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything
☆270Apr 11, 2025Updated last year
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
lxtGH / DenseWorld-1M
View on GitHub
Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"
☆129Oct 2, 2025Updated 9 months ago
UX-Decoder / Semantic-SAM
View on GitHub
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
☆2,853Jul 10, 2025Updated last year
see-say-segment / sesame
View on GitHub
🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"
☆47Jun 16, 2024Updated 2 years ago