tmabraham / ddpo-pytorchLinks

Reproduction of DDPO paper (RLHF for diffusion)

☆89

Alternatives and similar repositories for ddpo-pytorch

Users that are interested in ddpo-pytorch are comparing it to the libraries listed below

Sorting:

CarperAI / DRLX
Diffusion Reinforcement Learning Library
☆188Updated last year
louaaron / Reflected-Diffusion
[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)
☆155Updated last year
mlfoundations / open-diffusion
Simple large-scale training of stable diffusion with multi-node support.
☆133Updated 2 years ago
lucidrains / recurrent-interface-network-pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…
☆205Updated last year
crowsonkb / consistency-models
A JAX implementation of the continuous time formulation of Consistency Models
☆85Updated 2 years ago
huggingface / amused
☆86Updated last year
lucidrains / multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
☆102Updated last year
nv-tlabs / GENIE
GENIE: Higher-Order Denoising Diffusion Solvers
☆95Updated last year
sayakpaul / single-video-curation-svd
Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.
☆81Updated last year
cloneofsimo / consistency_models
Unofficial Implementation of Consistency Models in pytorch
☆258Updated 2 years ago
Owen-Oertell / rlcm
☆52Updated 10 months ago
cloneofsimo / promptplusplus
☆73Updated 2 years ago
patil-suraj / vit-vqgan
JAX implementation ViT-VQGAN
☆83Updated 2 years ago
AndyShih12 / paradigms
PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight
☆144Updated last year
yilundu / reduce_reuse_recycle
ICML 2023: Reduce, Reuse, Recycle: Composing Energy-Based Diffusion Models with MCMC
☆143Updated 9 months ago
sayakpaul / cmmd-pytorch
PyTorch implementation of CLIP Maximum Mean Discrepancy (CMMD) for evaluating image generation models.
☆139Updated last year
mihirp1998 / AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…
☆294Updated 9 months ago
facebookresearch / EvalGIM
🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…
☆82Updated 7 months ago
lucidrains / lumiere-pytorch
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
☆279Updated last year
cloneofsimo / vqgan-training
Train VAE like a boss
☆287Updated 9 months ago
johnrobinsn / diffusion_experiments
☆52Updated 2 years ago
dome272 / Flow-Matching
My take on Flow Matching
☆69Updated 6 months ago
TZW1998 / Taming-Stable-Diffusion-with-Human-Ranking-Feedback
This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et…
☆196Updated 2 years ago
lucidrains / flexible-diffusion-modeling-videos-pytorch
Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…
☆85Updated 3 years ago
kvablack / LLaVA-server
☆23Updated last year
Newbeeer / stf
Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”
☆76Updated 2 years ago
cvpr2023-tutorial-diffusion-models / papers
☆211Updated 2 years ago
cloneofsimo / karras-power-ema-tutorial
☆53Updated last year
zacharyhorvitz / Fk-Diffusion-Steering
A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.
☆172Updated last month
huggingface / instruction-tuned-sd
Code for instruction-tuning Stable Diffusion.
☆237Updated last year