tmabraham / ddpo-pytorchLinks
Reproduction of DDPO paper (RLHF for diffusion)
☆93Updated 2 years ago
Alternatives and similar repositories for ddpo-pytorch
Users that are interested in ddpo-pytorch are comparing it to the libraries listed below
Sorting:
- Diffusion Reinforcement Learning Library☆194Updated last year
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆159Updated 2 years ago
- Unofficial Implementation of Consistency Models in pytorch☆260Updated 2 years ago
- This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et…☆196Updated 2 years ago
- ☆58Updated last year
- ☆211Updated 2 years ago
- GENIE: Higher-Order Denoising Diffusion Solvers☆96Updated 2 years ago
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆207Updated last year
- JAX implementation ViT-VQGAN☆82Updated 3 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- denoising diffusion models, as simple as possible☆173Updated 3 years ago
- [ICLR 2023]DEIS: Fast Sampling of Diffusion Models with Exponential Integrator☆160Updated 2 years ago
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆237Updated 2 years ago
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆158Updated last year
- ☆73Updated 2 years ago
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆281Updated last year
- Code for NeurIPS 2023 paper "Restart Sampling for Improving Generative Processes"☆152Updated 2 years ago
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆81Updated 2 years ago
- Implementation of a multimodal diffusion transformer in Pytorch☆107Updated last year
- A Toolkit for OpenAI's Consistency Models.☆207Updated 2 years ago
- ☆53Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆310Updated last year
- Official Implementation of understanding the latent space of diffusion models through the lens of riemannian geometry (NeurIPS 2023)☆91Updated last year
- ☆23Updated 2 years ago
- Code for instruction-tuning Stable Diffusion.☆248Updated last year
- ICML 2023: Reduce, Reuse, Recycle: Composing Energy-Based Diffusion Models with MCMC☆148Updated last year
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆151Updated 2 years ago
- ☆52Updated 3 years ago
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85Updated 3 years ago