bubbliiiing / DiT-pytorchLinks

这是一个DiT-pytorch的代码，主要用于学习DiT结构。

☆78

Alternatives and similar repositories for DiT-pytorch

Users that are interested in DiT-pytorch are comparing it to the libraries listed below

Sorting:

Berry-Wu / Visualization
visualization：filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro
☆126Updated 2 years ago
bubbliiiing / stable-diffusion
这是一个stable-diffusion的库。
☆125Updated last year
AFeng-x / SMT
[ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".
☆210Updated 2 years ago
LeiyiHU / mona
The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".
☆346Updated last month
Yaziwel / Awesome-RWKV-in-Vision
A curated list of papers on the applications of RWKV in computer vision.
☆200Updated last month
yangyangxu0 / DeMT
☆77Updated 2 years ago
WGS-note / finetune_stable_diffusion
finetune stable diffusion with Dreambooth、LoRA、ControlNet
☆58Updated 2 years ago
TerryPei / EfficientVMamba
Code Implementation of EfficientVMamba
☆219Updated last year
AILab-CVC / GroupMixFormer
GroupMixAttention and GroupMixFormer
☆117Updated last year
ma-xu / EfficientMod
[ICLR 2024 poster] Efficient Modulation for Vision Networks
☆55Updated last year
ysj9909 / SHViT
[CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design
☆117Updated last year
EnVision-Research / MTMamba
☆45Updated last week
VISION-SJTU / QuadMamba
☆41Updated this week
Zhao-Yian / GraCo
[CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.
☆58Updated 4 months ago
ChenhongyiYang / PlainMamba
[BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition
☆80Updated 4 months ago
EasonXiao-888 / MambaTree
[NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model
☆99Updated last year
owenliang / mnist-dits
Diffusion Transformers (DiTs) trained on MNIST dataset
☆124Updated last year
ZacharyMeng / PolaFormer
Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)
☆68Updated 2 months ago
zhang-haojie / wesam
[CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adapta…
☆173Updated 11 months ago
luckybird1994 / ASAM
☆93Updated last year
MzeroMiko / vHeat
vHeat: Building Vision Models upon Heat Conduction
☆241Updated last month
SuperBruceJia / CVPR-LaTeX-Paper-Template
These are CVPR Main Paper, Supplementary Materials, and Rebuttal LaTeX templates.
☆24Updated 2 years ago
OpenGVLab / Vision-RWKV
[ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
☆485Updated 5 months ago
Jasonlee1995 / ImageNet-1K
ImageNet-1K data download, processing for using as a dataset
☆104Updated 2 years ago
xinghaochen / SLAB
[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…
☆107Updated 11 months ago
Zeyi-Lin / Stable-Diffusion-Example
Stable Diffusion模型训练样例代码
☆45Updated last year
tyshiwo1 / DiM-DiffusionMamba
The official implementation of DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis
☆205Updated last year
xxcheng0708 / pytorch-model-train-template
pytorch单精度、半精度、混合精度、单卡、多卡（DP / DDP）、FSDP、DeepSpeed模型训练代码，并对比不同方法的训练速度以及GPU内存的使用
☆114Updated last year
hhaAndroid / awesome-mm-chat
多模态 MM +Chat 合集
☆273Updated 2 months ago
LeapLabTHU / Agent-Attention
Official repository of Agent Attention (ECCV2024)
☆635Updated 8 months ago