tmabraham / ddpo-pytorchLinks
Reproduction of DDPO paper (RLHF for diffusion)
☆90Updated last year
Alternatives and similar repositories for ddpo-pytorch
Users that are interested in ddpo-pytorch are comparing it to the libraries listed below
Sorting:
- Diffusion Reinforcement Learning Library☆191Updated last year
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆157Updated last year
- This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et…☆196Updated 2 years ago
- Implementation of a multimodal diffusion transformer in Pytorch☆103Updated last year
- Unofficial Implementation of Consistency Models in pytorch☆258Updated 2 years ago
- My take on Flow Matching☆72Updated 8 months ago
- ☆73Updated 2 years ago
- ☆54Updated 11 months ago
- Train VAE like a boss☆292Updated 10 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- ICML 2023: Reduce, Reuse, Recycle: Composing Energy-Based Diffusion Models with MCMC☆145Updated 11 months ago
- ☆53Updated last year
- ☆52Updated 2 years ago
- GENIE: Higher-Order Denoising Diffusion Solvers☆95Updated last year
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆206Updated last year
- A mini-library for training consistency models.☆247Updated last year
- A demo for the Direct Ascent Synthesis: Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)☆40Updated 6 months ago
- ☆211Updated 2 years ago
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆134Updated 2 years ago
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆235Updated 2 years ago
- Code for NeurIPS 2023 paper "Restart Sampling for Improving Generative Processes"☆152Updated last year
- Text to Image Latent Diffusion using a Transformer core☆205Updated last year
- PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.☆143Updated last year
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆76Updated 2 years ago
- Code for instruction-tuning Stable Diffusion.☆239Updated last year
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85Updated 3 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆299Updated 10 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated 2 years ago
- Iterable datapipelines for pytorch training.☆87Updated last year