Alokia / diffusion-DDPM-pytorch
This is a pytorch implementation of Denoising Diffusion Probabilistic Models
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for diffusion-DDPM-pytorch
- [ICLR 2024] Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models.☆55Updated 3 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated 6 months ago
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆55Updated last year
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆27Updated 6 months ago
- Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"☆78Updated 9 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆55Updated 2 weeks ago
- The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"☆61Updated 7 months ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆23Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆63Updated 2 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- [ICCV 2023 Oral] Official Implementation of "Denoising Diffusion Autoencoders are Unified Self-supervised Learners"☆143Updated 8 months ago
- ☆57Updated last year
- ICLR‘24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization☆64Updated 9 months ago
- The offical implemention of JM3D.☆27Updated last year
- ☆38Updated last year
- ☆32Updated 11 months ago
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"☆23Updated 3 months ago
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆30Updated last year
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆69Updated 2 months ago
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆17Updated 7 months ago
- [CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation☆76Updated 4 months ago
- 【ICCV 2023】Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning☆56Updated 3 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆26Updated last week
- Implementation of "VL-Mamba: Exploring State Space Models for Multimodal Learning"☆77Updated 7 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆37Updated 10 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆39Updated last year
- ☆21Updated last year
- This is a pytorch implementation of Denoising Diffusion Implicit Models☆58Updated last year
- ☆32Updated 7 months ago
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆40Updated 3 months ago