zhaoyl18 / SEIKOLinks
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
☆27Updated last year
Alternatives and similar repositories for SEIKO
Users that are interested in SEIKO are comparing it to the libraries listed below
Sorting:
- Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences li…☆152Updated last year
- Inference-Time Alignment in Protein Diffusion Models☆47Updated 10 months ago
- Derivative-Free Guidance in Diffusion Models with Soft Value-Based Decoding. For controlled generation in DNA, RNA, proteins, molecules (…☆34Updated last year
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆35Updated 2 years ago
- Simple Guidance Mechanisms for Discrete Diffusion Models☆58Updated 11 months ago
- Reward fine-tuning for Stable Diffusion models based on stochastic optimal control, including Adjoint Matching☆52Updated 6 months ago
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆23Updated 9 months ago
- 📰 Must-Read Papers on Offline Model-Based Optimization 🔥☆27Updated 5 months ago
- ☆38Updated last year
- Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models☆72Updated 2 years ago
- Code for paper: "Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design"☆62Updated 6 months ago
- ☆25Updated last year
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆28Updated 6 months ago
- This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…☆14Updated 2 years ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆83Updated 7 months ago
- Code repository for Trajectory Flow Matching