zeyuwang-zju / DiffX
Official code for "DiffX: Guide Your Layout to Cross-Modal Generative Modeling"
☆13Updated this week
Related projects: ⓘ
- [CVPR 2024] LAKE-RED: Camouflaged Images Generation by Latent Background Knowledge Retrieval-Augmented Diffusion.☆37Updated 2 months ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆150Updated 10 months ago
- vHeat: Building Vision Models upon Heat Conduction☆91Updated 3 months ago
- The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆58Updated 3 months ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆67Updated 3 weeks ago
- ☆104Updated 3 months ago
- ☆46Updated last month
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆53Updated 3 months ago
- Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"☆48Updated 4 months ago
- ☆76Updated 2 months ago
- [ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption☆50Updated 9 months ago
- ☆130Updated last year
- [CVPR 2023] Explicit Visual Prompting for Low-Level Structure Segmentations☆180Updated 9 months ago
- [CVPR-W 2023] Official Implementation of One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models☆72Updated 8 months ago
- CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts☆38Updated 3 weeks ago
- Official code for "CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation"☆115Updated 3 months ago
- An open source codebase for object detection based on Jittor☆16Updated 5 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆161Updated 7 months ago
- The official repository of the paper "Learning Correlation Structures for Vision Transformers" accepted to CVPR 2024.☆39Updated 5 months ago
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆32Updated last month
- Vision Mamba: A Comprehensive Survey and Taxonomy☆72Updated 3 weeks ago
- Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"☆75Updated 8 months ago
- A collection of papers on Diffusion for Image-to-Image Translation and Style Transfer☆92Updated this week
- The repository of Expanding Small-Scale Datasets with Guided Imagination (NeurIPS 2023).☆73Updated 8 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆88Updated last month
- SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process☆151Updated 8 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆75Updated 6 months ago
- SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution☆95Updated 5 months ago
- Official implementation of paper titled "GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model"☆56Updated 2 months ago
- CVPR2024: Dual Memory Networks: A Versatile Adaptation Approach for Vision-Language Models☆54Updated 2 months ago