tmabraham / ddpo-pytorch
Reproduction of DDPO paper (RLHF for diffusion)
☆70Updated last year
Related projects: ⓘ
- ☆19Updated 10 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆228Updated 6 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆122Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆110Updated 5 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆84Updated 5 months ago
- Diffusion Reinforcement Learning Library☆171Updated 7 months ago
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆81Updated 8 months ago
- WIP☆76Updated last month
- ☆74Updated 8 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆92Updated 2 months ago
- ☆52Updated last year
- Code for the paper "Training Diffusion Models with Reinforcement Learning"☆317Updated last year
- ☆68Updated 2 months ago
- ☆43Updated 4 months ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆101Updated last year
- ☆106Updated 10 months ago
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆154Updated 11 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆115Updated last month
- Official implementation of the paper The Hidden Language of Diffusion Models☆66Updated 7 months ago
- ☆147Updated last year
- ☆72Updated last year
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆105Updated 3 weeks ago
- Implementation of Infini-Transformer in Pytorch☆100Updated last month
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆49Updated last year
- Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch☆236Updated 3 weeks ago
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆397Updated 5 months ago
- Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"☆91Updated 6 months ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆84Updated 2 years ago
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆159Updated 3 months ago
- ☆48Updated 8 months ago