QianWangX / VidSeg_diffusionLinks
Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]
☆53Updated 6 months ago
Alternatives and similar repositories for VidSeg_diffusion
Users that are interested in VidSeg_diffusion are comparing it to the libraries listed below
Sorting:
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆151Updated last week
- Scene-Centric Unsupervised Panoptic Segmentation (CVPR 2025 Highlight)☆66Updated 2 months ago
- ☆24Updated 5 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆103Updated 5 months ago
- [NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …☆67Updated last year
- LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS☆96Updated last month
- Official Implementation of DINO-Foresight: Looking into the Future with DINO☆60Updated last week
- ☆101Updated last week
- [CVPR'2025] EntitySAM: Segment Everything in Video☆41Updated last month
- [ECCV 2024] Decomposition Betters Tracking Everything Everywhere☆113Updated last year
- PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion Preimage☆48Updated 2 months ago
- ☆34Updated last year
- 4D Panoptic Scene Graph Generation (NeurIPS'23 Spotlight)☆112Updated 5 months ago
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆42Updated 3 months ago
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆80Updated last year
- [ECCV 2024] DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control☆82Updated 9 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆290Updated 6 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆18Updated 2 months ago
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆181Updated last month
- ☆33Updated 3 months ago
- ☆93Updated last month
- [ICLR 25'] InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting☆20Updated 4 months ago
- [CVPR2024] SANeRF-HQ: Segment Anything for NeRF in High Quality.☆50Updated last year
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆71Updated last year
- (ICCV 2025) ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations☆66Updated 2 weeks ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆88Updated 4 months ago
- [CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction☆78Updated last year
- [AAAI 2025] GFlow: Recovering 4D World from Monocular Video☆49Updated 3 months ago
- Segment This Thing is an efficient image segmentation models that uses a biologically-inspired foveated tokenization to reduce inference …☆48Updated 2 months ago
- [IV 2025, Oral] Official code of "6Img-to-3D: Few-Image Large-Scale Outdoor Novel View Synthesis"☆77Updated last month