bubbliiiing / DiT-pytorchLinks
这是一个DiT-pytorch的代码,主要用于学习DiT结构。
☆80Updated last year
Alternatives and similar repositories for DiT-pytorch
Users that are interested in DiT-pytorch are comparing it to the libraries listed below
Sorting:
- 这是一个stable-diffusion的库。☆125Updated 2 years ago
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆58Updated 2 years ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆58Updated 5 months ago
- visualization:filter、feature map、attention map、image-mask、grad-cam、human keypoint、guided-backpro☆127Updated 2 years ago
- [CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adapta…☆172Updated last year
- ☆77Updated 2 years ago
- ☆93Updated last year
- [ICCV2023] This is an official implementation for "Scale-Aware Modulation Meet Transformer".☆210Updated 2 years ago
- GroupMixAttention and GroupMixFormer☆117Updated last year
- Stable Diffusion模型训练样例代码☆45Updated last year
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆353Updated 2 months ago
- SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process☆195Updated last year
- Code Implementation of EfficientVMamba☆224Updated last year
- ☆29Updated last year
- Diffusion Transformers (DiTs) trained on MNIST dataset☆127Updated last year
- ImageNet-1K data download, processing for using as a dataset☆109Updated 2 years ago
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆100Updated last year
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆89Updated 3 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆79Updated 4 months ago
- ☆42Updated 3 weeks ago
- A curated list of papers on the applications of RWKV in computer vision.☆204Updated 2 months ago
- ☆44Updated 7 months ago
- [ICLR 2025 Spotlight] Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures☆490Updated 6 months ago
- ☆132Updated 2 years ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆56Updated last year
- [CVPR 2024] SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design☆118Updated last year
- [ICLR 2024] MogaNet: Efficient Multi-order Gated Aggregation Network☆248Updated 2 months ago
- ☆155Updated last year
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆95Updated 2 years ago
- [ICCV2023] DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models☆180Updated last year