sayakpaul / single-video-curation-svd
Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.
☆81Updated 8 months ago
Related projects: ⓘ
- ☆74Updated 8 months ago
- ☆147Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆122Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆84Updated 5 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆90Updated 2 months ago
- ☆168Updated 2 months ago
- Official code for 'Paragraph-to-Image Generation with Information-Enriched Diffusion Model'☆93Updated 4 months ago
- Repo is required for the code of our research paper on micro-budget training of large scale diffusion model.☆146Updated last month
- Code for instruction-tuning Stable Diffusion.☆190Updated 7 months ago
- Data release for the ImageInWords (IIW) paper.☆194Updated 3 months ago
- Scaling Diffusion Transformers with Mixture of Experts☆178Updated last week
- Iterable datapipelines for pytorch training.☆78Updated 2 weeks ago
- Official Implementation of weights2weights☆98Updated last week
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆279Updated 9 months ago
- GenEval: An object-focused framework for evaluating text-to-image alignment☆85Updated last month
- Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"☆91Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)☆144Updated 2 weeks ago
- ☆65Updated last year
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆175Updated last week
- This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆115Updated 3 months ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆94Updated 11 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆70Updated last month
- ☆52Updated last year
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆142Updated 2 months ago
- [Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation☆190Updated 3 weeks ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆241Updated last month
- Reproduction of DDPO paper (RLHF for diffusion)☆70Updated last year
- Official code implementation for our paper -- Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models.☆25Updated last year
- ☆72Updated last year
- ☆99Updated 6 months ago