tmabraham / ddpo-pytorchLinks
Reproduction of DDPO paper (RLHF for diffusion)
☆89Updated last year
Alternatives and similar repositories for ddpo-pytorch
Users that are interested in ddpo-pytorch are comparing it to the libraries listed below
Sorting:
- Diffusion Reinforcement Learning Library☆188Updated last year
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆155Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆205Updated last year
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- ☆86Updated last year
- Implementation of a multimodal diffusion transformer in Pytorch☆102Updated last year
- GENIE: Higher-Order Denoising Diffusion Solvers☆95Updated last year
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆81Updated last year
- Unofficial Implementation of Consistency Models in pytorch☆258Updated 2 years ago
- ☆52Updated 10 months ago
- ☆73Updated 2 years ago
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆144Updated last year
- ICML 2023: Reduce, Reuse, Recycle: Composing Energy-Based Diffusion Models with MCMC☆143Updated 9 months ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆139Updated last year
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆294Updated 9 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆82Updated 7 months ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆279Updated last year
- Train VAE like a boss☆287Updated 9 months ago
- ☆52Updated 2 years ago
- My take on Flow Matching☆69Updated 6 months ago
- This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et…☆196Updated 2 years ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85Updated 3 years ago
- ☆23Updated last year
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆76Updated 2 years ago
- ☆211Updated 2 years ago
- ☆53Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆172Updated last month
- Code for instruction-tuning Stable Diffusion.☆237Updated last year