Reproduction of DDPO paper (RLHF for diffusion)
☆94Sep 20, 2023Updated 2 years ago
Alternatives and similar repositories for ddpo-pytorch
Users that are interested in ddpo-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RLHF for Stable Diffusion☆14Jul 9, 2023Updated 2 years ago
- Diffusion Reinforcement Learning Library☆195Feb 13, 2024Updated 2 years ago
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆766Mar 22, 2024Updated 2 years ago
- 北京理工大学2019级人工智能专业课程作业分享☆17Apr 7, 2023Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Controllable Text-to-Image Generation with Customized Guidance on Appearance and Position, w/ Stable Diffusion☆31Jul 3, 2024Updated last year
- Code for the paper "Training Diffusion Models with Reinforcement Learning"☆572Jul 5, 2023Updated 2 years ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆320Nov 1, 2024Updated last year
- Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing v…☆13Aug 18, 2023Updated 2 years ago
- A example pipeline to use InstructPix2Pix and the associated fine-tuned motion module☆31Sep 29, 2023Updated 2 years ago
- ☆594Dec 21, 2024Updated last year
- ☆59Sep 23, 2024Updated last year
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- Implementation of papers in 101 lines of code.☆18Nov 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆699Jun 2, 2026Updated 2 weeks ago
- A Julia package for the computation of hard, theoretically guaranteed bounds on the moments of jump-diffusion processes with polynomial d…☆15May 19, 2025Updated last year
- A collection of python utilities for mesh processing☆13Jan 20, 2023Updated 3 years ago
- ☆12Aug 25, 2022Updated 3 years ago
- Code for https://arxiv.org/abs/1811.00145☆12Feb 13, 2021Updated 5 years ago
- One stop shop for all things carp☆58Sep 9, 2022Updated 3 years ago
- Train vision models using JAX and 🤗 transformers☆102Dec 14, 2025Updated 6 months ago
- The official repository of PowersheLLM, a model for Powershell maliciousness detection using fine-tuned LLM☆14Jun 6, 2024Updated 2 years ago
- Official Implementation of wd1☆30Sep 25, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆160Oct 19, 2023Updated 2 years ago
- The dataset repo of "CLCIFAR: CIFAR-Derived Benchmark Datasets with Human Annotated Complementary Labels" paper☆17May 11, 2026Updated last month
- [NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs☆38Dec 15, 2025Updated 6 months ago
- Implementation of the Doubly Stochastic Neighbor Embedding on Spheres algorithm published by Yao Lu in Sep. 2016 (Source : https://arxiv.…☆15Apr 8, 2018Updated 8 years ago
- Experiments with effect systems☆12Apr 18, 2016Updated 10 years ago
- A Learnable LSH Framework for Efficient NN Training☆34Jul 22, 2021Updated 4 years ago
- Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models☆175Oct 8, 2023Updated 2 years ago
- Here we provide and collect many functions to generate math problem and step by step solutions for LLM training☆19Jun 21, 2023Updated 2 years ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆439Aug 9, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations is a ServiceNow Research project that was started at Elemen…☆13Jul 31, 2023Updated 2 years ago
- [ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆1,640Mar 16, 2025Updated last year
- Git Deployment with PHP☆36May 16, 2014Updated 12 years ago
- Simple repository for training small reasoning models☆52Feb 17, 2026Updated 3 months ago
- Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models☆13Dec 5, 2024Updated last year
- ☆17Aug 17, 2021Updated 4 years ago
- A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)☆311Sep 4, 2024Updated last year