FareedKhan-dev / create-stable-diffusion-from-scratch
Implemented a stable diffusion architecture using PyTorch.
☆35Updated 8 months ago
Related projects: ⓘ
- Reproduction of DDPO paper (RLHF for diffusion)☆70Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆65Updated 4 months ago
- Iterable datapipelines for pytorch training.☆78Updated 2 weeks ago
- ☆65Updated this week
- Implementation of a multimodal diffusion transformer in Pytorch☆92Updated 2 months ago
- research work on multimodal cognitive ai☆54Updated 3 weeks ago
- Scaling Diffusion Transformers with Mixture of Experts☆178Updated last week
- A Gradio component that can be used to annotate images with bounding boxes.☆26Updated 2 weeks ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆115Updated last month
- Data release for the ImageInWords (IIW) paper.☆194Updated 3 months ago
- ☆74Updated 8 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆241Updated last month
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated 10 months ago
- Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗☆120Updated 8 months ago
- ☆55Updated 3 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆54Updated 3 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆53Updated last month
- VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆93Updated last month
- ☆176Updated 6 months ago
- Text to Image Latent Diffusion using a Transformer core☆124Updated 3 weeks ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆84Updated 5 months ago
- Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"☆85Updated 6 months ago
- Image Prompter for Gradio☆66Updated 9 months ago
- ☆38Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆49Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- ☆147Updated last year
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆81Updated 8 months ago
- ☆51Updated last year
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆115Updated 2 weeks ago