bubbliiiing / DiT-pytorch
这是一个DiT-pytorch的代码,主要用于学习DiT结构。
☆59Updated 6 months ago
Related projects: ⓘ
- finetune stable diffusion with Dreambooth、LoRA、ControlNet☆51Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆67Updated 3 weeks ago
- ☆66Updated last year
- The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆58Updated 3 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆67Updated last month
- InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024☆68Updated 5 months ago
- [ICLR 2024 poster] Efficient Modulation for Vision Networks☆46Updated 2 months ago
- A paper list of some recent Mamba-based CV works.☆156Updated this week
- RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization☆31Updated this week
- Diffusion Transformers (DiTs) trained on MNIST dataset☆40Updated 5 months ago
- The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"☆64Updated 3 months ago
- ImageNet-1K data download, processing for using as a dataset☆55Updated last year
- [CVPR 2024] Code for "Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adapta…☆130Updated last month
- GroupMixAttention and GroupMixFormer☆108Updated 9 months ago
- One summary of efficient segment anything models☆62Updated last month
- [ECCV2024] Official implementation of Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes☆51Updated last month
- 多模态 MM +Chat 合集☆187Updated 2 weeks ago
- ☆116Updated last year
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆41Updated 2 months ago
- ☆123Updated 8 months ago
- ☆48Updated last year
- Code Implementation of EfficientVMamba☆172Updated 5 months ago
- ☆81Updated 3 months ago
- ☆64Updated 7 months ago
- A curated list of papers on the applications of RWKV in computer vision.☆88Updated last month
- Daily feed of this day's research articles about Computer Vision published to https://arxiv.org.☆33Updated this week
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆136Updated 2 years ago
- [CVPR2023] This is an official mmdet implementation of paper "DETRs with Hybrid Matching".☆47Updated last year
- [CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"☆77Updated 2 months ago
- DynRefer: Delving into Region-level Multi-modality Tasks via Dynamic Resolution☆34Updated 2 months ago