KaiChen1998 / GeoDiffusion
Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)
☆58Updated 4 months ago
Related projects: ⓘ
- Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆61Updated 2 months ago
- DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆106Updated 3 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆150Updated 10 months ago
- official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation☆29Updated 3 weeks ago
- [BMVC 2024] Official implementation of Align-DETR☆48Updated last month
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆67Updated last month
- [ICCV2023] DETRDistill: A Universal Knowledge Distillation Framework for DETR-families☆37Updated 10 months ago
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".☆51Updated 2 months ago
- Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation (ICCV 2023)☆61Updated 11 months ago
- 😎 Awesome lists of papers and codes about open-vocabulary perception, including both 3D and 2D☆20Updated 2 months ago
- Official Repo for PosSAM: Panoptic Open-vocabulary Segment Anything☆41Updated 5 months ago
- [NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models☆128Updated 9 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆32Updated last month
- ☆35Updated last year
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆75Updated 6 months ago
- ☆14Updated 8 months ago
- [ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS☆109Updated 6 months ago
- ICCV'2023 | CTVIS: Consistent Training for Online Video Instance Segmentation☆70Updated 11 months ago
- ☆81Updated 3 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆161Updated 7 months ago
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆68Updated 5 months ago
- The official PyTorch code for "Traffic Scene Parsing through the TSP6K Dataset".☆20Updated last week
- ☆75Updated last year
- ☆41Updated this week
- [ICCV 2023] PyTorch implementation of RandBox☆51Updated 10 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆45Updated 4 months ago
- [CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis☆143Updated last year
- ☆16Updated last month
- Official implementation of "Can Language Understand Depth?"☆73Updated last year
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆63Updated 9 months ago