bubbliiiing / DiT-pytorch
这是一个DiT-pytorch的代码,主要用于学习DiT结构。
☆70Updated 11 months ago
Alternatives and similar repositories for DiT-pytorch:
Users that are interested in DiT-pytorch are comparing it to the libraries listed below
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆54Updated last year
- Diffusion Transformers (DiTs) trained on MNIST dataset☆83Updated 9 months ago
- 这是一个stable-diffusion的库。☆119Updated last year
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆106Updated last year
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆89Updated 7 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆72Updated 5 months ago
- Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆74Updated last month
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆81Updated 2 weeks ago
- ☆72Updated last year
- A curated list of papers on the applications of RWKV in computer vision.☆145Updated 2 weeks ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆77Updated 5 months ago
- 帮助新手快速入门、快速使用、习惯 OpenMMLab 开源库官方文档且能够自主上手实验,自由选择阅读更深层的知识。☆58Updated last year
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆197Updated last year
- 这是一个blip-pytorch简化的代码,适用于了解Attention与Transformer的结构。☆45Updated last year
- 多模态 MM +Chat 合集☆238Updated 2 weeks ago
- Code Implementation of EfficientVMamba☆193Updated 9 months ago
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆30Updated 2 months ago
- pytorch ddpm demo☆81Updated last year
- 500 行代码实现降噪扩散模型 DDPM,干净无依赖☆157Updated 10 months ago
- GroupMixAttention and GroupMixFormer☆115Updated last year
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆86Updated 5 months ago
- vHeat: Building Vision Models upon Heat Conduction☆102Updated 7 months ago
- ImageNet-1K data download, processing for using as a dataset☆80Updated last year
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆178Updated 4 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆50Updated 3 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆50Updated 6 months ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆48Updated 3 weeks ago
- ☆81Updated last year
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆173Updated 7 months ago
- ☆124Updated 2 years ago