Simple large-scale training of stable diffusion with multi-node support.
☆133May 8, 2023Updated 2 years ago
Alternatives and similar repositories for open-diffusion
Users that are interested in open-diffusion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Un-*** 50 billions multimodality dataset☆23Sep 14, 2022Updated 3 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- ☆65Oct 4, 2023Updated 2 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- Open reproduction of MUSE for fast text2image generation.☆359Jun 1, 2024Updated last year
- CLIP-like model evaluation☆806Jan 15, 2026Updated 2 months ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- DataComp: In search of the next generation of multimodal datasets☆771Apr 28, 2025Updated 10 months ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Feb 19, 2023Updated 3 years ago
- Get OpenAI GPT models to review your PR's☆45Jun 9, 2023Updated 2 years ago
- Train vision models using JAX and 🤗 transformers☆101Dec 14, 2025Updated 3 months ago
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,251Nov 30, 2022Updated 3 years ago
- ViT trained on COYO-Labeled-300M dataset☆33Nov 24, 2022Updated 3 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Apr 20, 2023Updated 2 years ago
- ☆16Oct 19, 2022Updated 3 years ago
- Karras et al. (2022) diffusion models for PyTorch☆2,575Feb 12, 2026Updated last month
- Official Code Release for Container : Context Aggregation Network☆46Oct 17, 2021Updated 4 years ago
- ☆130Mar 20, 2023Updated 3 years ago
- Teach-DETR: Better Training DETR with Teachers☆31Mar 18, 2024Updated 2 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- Efficiently read embedding in streaming from any filesystem☆105Aug 9, 2025Updated 7 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆92Mar 16, 2023Updated 3 years ago
- An open-source framework for training large multimodal models.☆4,079Aug 31, 2024Updated last year
- Easily compute clip embeddings and build a clip retrieval system with them☆2,733Aug 15, 2025Updated 7 months ago
- Consistency Distilled Diff VAE☆2,213Nov 7, 2023Updated 2 years ago
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Dec 22, 2022Updated 3 years ago
- ☆27Mar 13, 2021Updated 5 years ago
- PyTorch code for MUST☆108May 1, 2025Updated 10 months ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆188Jun 21, 2025Updated 9 months ago
- ☆11Jan 18, 2024Updated 2 years ago
- Unofficial implementation of Tune-A-Video☆192Jan 12, 2023Updated 3 years ago
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆21Oct 11, 2022Updated 3 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆214Feb 27, 2024Updated 2 years ago
- GPU controlled Hetzner Cloud workers swarm for Crawling@Home project☆58Oct 9, 2022Updated 3 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 3 years ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆4,385Oct 19, 2025Updated 5 months ago