Simple large-scale training of stable diffusion with multi-node support.
☆133May 8, 2023Updated 2 years ago
Alternatives and similar repositories for open-diffusion
Users that are interested in open-diffusion are comparing it to the libraries listed below
Sorting:
- Un-*** 50 billions multimodality dataset☆23Sep 14, 2022Updated 3 years ago
- ☆65Oct 4, 2023Updated 2 years ago
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- ViT trained on COYO-Labeled-300M dataset☆33Nov 24, 2022Updated 3 years ago
- Open reproduction of MUSE for fast text2image generation.☆359Jun 1, 2024Updated last year
- JAX implementation ViT-VQGAN☆82Sep 21, 2022Updated 3 years ago
- Get OpenAI GPT models to review your PR's☆45Jun 9, 2023Updated 2 years ago
- CLIP-like model evaluation☆802Jan 15, 2026Updated last month
- ☆16Oct 19, 2022Updated 3 years ago
- DataComp: In search of the next generation of multimodal datasets☆772Apr 28, 2025Updated 10 months ago
- COYO-700M: Large-scale Image-Text Pair Dataset☆1,251Nov 30, 2022Updated 3 years ago
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Dec 22, 2022Updated 3 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Feb 19, 2023Updated 3 years ago
- ☆130Mar 20, 2023Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- Official Code Release for Container : Context Aggregation Network☆46Oct 17, 2021Updated 4 years ago
- PyTorch code for MUST☆108May 1, 2025Updated 10 months ago
- Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"☆45Aug 25, 2022Updated 3 years ago
- Efficiently read embedding in streaming from any filesystem☆105Aug 9, 2025Updated 6 months ago
- Train vision models using JAX and 🤗 transformers☆100Dec 14, 2025Updated 2 months ago
- Teach-DETR: Better Training DETR with Teachers☆31Mar 18, 2024Updated last year
- OFA-Compress is a unified framework which provides OFA model finetuning, distillation and inference capabilities in Huggingface version, …☆29Sep 22, 2022Updated 3 years ago
- Karras et al. (2022) diffusion models for PyTorch☆2,566Feb 12, 2026Updated 2 weeks ago
- Unofficial implementation of Tune-A-Video☆192Jan 12, 2023Updated 3 years ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,730Aug 15, 2025Updated 6 months ago
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- ☆52Mar 12, 2023Updated 2 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆213Feb 27, 2024Updated 2 years ago
- Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm☆675Sep 19, 2022Updated 3 years ago
- [ICCV 2023] You Only Look at One Partial Sequence☆343Oct 21, 2023Updated 2 years ago
- An open-source framework for training large multimodal models.☆4,068Aug 31, 2024Updated last year
- Consistency Distilled Diff VAE☆2,209Nov 7, 2023Updated 2 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆33Apr 18, 2022Updated 3 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆92Mar 16, 2023Updated 2 years ago
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆748Oct 4, 2023Updated 2 years ago