tmabraham / ddpo-pytorch
Reproduction of DDPO paper (RLHF for diffusion)
☆82Updated last year
Alternatives and similar repositories for ddpo-pytorch:
Users that are interested in ddpo-pytorch are comparing it to the libraries listed below
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆158Updated last year
- Diffusion Reinforcement Learning Library☆181Updated last year
- ☆51Updated last year
- ☆51Updated 6 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆278Updated 4 months ago
- ☆84Updated last year
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 2 years ago
- Unofficial Implementation of Consistency Models in pytorch☆254Updated 2 years ago
- Iterable datapipelines for pytorch training.☆81Updated 6 months ago
- JAX implementation ViT-VQGAN☆82Updated 2 years ago
- Train VAE like a boss☆270Updated 5 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated last year
- Implementation of a multimodal diffusion transformer in Pytorch☆101Updated 8 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆112Updated last month
- ☆211Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆117Updated 11 months ago
- ☆31Updated last year
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆201Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆129Updated last year
- GENIE: Higher-Order Denoising Diffusion Solvers☆94Updated last year
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆511Updated last year
- ☆21Updated last year
- Focused on fast experimentation and simplicity☆69Updated 2 months ago
- ICML 2023: Reduce, Reuse, Recycle: Composing Energy-Based Diffusion Models with MCMC☆138Updated 5 months ago
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆73Updated last year
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆82Updated last year
- This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et…☆196Updated 2 years ago
- ☆72Updated last year
- [ICLR 2023]DEIS: Fast Sampling of Diffusion Models with Exponential Integrator☆156Updated 2 years ago
- Official Implementation of understanding the latent space of diffusion models through the lens of riemannian geometry (NeurIPS 2023)☆84Updated last year