bubbliiiing / DiT-pytorch
这是一个DiT-pytorch的代码,主要用于学习DiT结构。
☆75Updated last year
Alternatives and similar repositories for DiT-pytorch:
Users that are interested in DiT-pytorch are comparing it to the libraries listed below
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆56Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆133Updated last week
- 这是一个stable-diffusion的库。☆124Updated last year
- ☆91Updated 9 months ago
- Diffusion Transformers (DiTs) trained on MNIST dataset☆102Updated last year
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆117Updated 2 years ago
- [CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adapta…☆160Updated 8 months ago
- Official repository of Polarity-aware Linear Attention for Vision Transformers (ICLR 2025)☆61Updated 2 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆76Updated 2 weeks ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆80Updated last month
- ☆25Updated 9 months ago
- [NeurIPS 2024 Spotlight ⭐️] Parameter-Inverted Image Pyramid Networks (PIIP)☆88Updated this week
- 这是一个blip-pytorch简化的代码,适用于了解Attention与Transformer的结构。☆47Updated last year
- ☆74Updated last year
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆53Updated last month
- SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process☆185Updated last year
- ☆37Updated 5 months ago
- GroupMixAttention and GroupMixFormer☆115Updated last year
- README.md☆47Updated last year
- A curated list of papers on the applications of RWKV in computer vision.☆169Updated 2 months ago
- pytorch ddpm demo☆88Updated last year
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆81Updated 7 months ago
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆38Updated 6 months ago
- ☆36Updated 2 years ago
- [NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆202Updated 6 months ago
- [NeurIPS 24] MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks☆116Updated 5 months ago
- 多模态 MM +Chat 合集☆255Updated 2 months ago
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆56Updated 6 months ago
- Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆128Updated 4 months ago
- ☆138Updated last year