bubbliiiing / DiT-pytorch
这是一个DiT-pytorch的代码,主要用于学习DiT结构。
☆75Updated last year
Alternatives and similar repositories for DiT-pytorch:
Users that are interested in DiT-pytorch are comparing it to the libraries listed below
- A curated list of papers on the applications of RWKV in computer vision.☆163Updated last month
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆55Updated last year
- GroupMixAttention and GroupMixFormer☆115Updated last year
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆52Updated 9 months ago
- 这是一个stable-diffusion的库。☆124Updated last year
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆115Updated 2 years ago
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆100Updated 3 weeks ago
- Code Implementation of EfficientVMamba☆203Updated 11 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆74Updated 7 months ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆92Updated 9 months ago
- Diffusion Transformers (DiTs) trained on MNIST dataset☆96Updated 11 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆56Updated last month
- ☆36Updated 4 months ago
- The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis☆186Updated 9 months ago
- [CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'☆151Updated 2 weeks ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆52Updated 3 weeks ago
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆199Updated last year
- pytorch ddpm demo☆87Updated last year
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆33Updated 4 months ago
- ☆73Updated last year
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆75Updated 3 weeks ago
- [CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adapta…☆158Updated 7 months ago
- 500 行代码实现降噪扩散模型 DDPM,干净无依赖☆165Updated last year
- [AAAI 2025] Linear-complexity Visual Sequence Learning with Gated Linear Attention☆109Updated 9 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆86Updated 2 months ago
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆37Updated 5 months ago
- ☆136Updated last year
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆94Updated 9 months ago
- 多模态 MM +Chat 合集☆249Updated last month
- Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)☆108Updated 3 weeks ago