zhaoyl18 / SEIKOLinks
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
☆24Updated last year
Alternatives and similar repositories for SEIKO
Users that are interested in SEIKO are comparing it to the libraries listed below
Sorting:
- Code for the tutorial/review paper for RL-based-fine-tuniing. In this code, we especially focus on the design of biological sequences li…☆147Updated last year
- Inference-Time Alignment in Protein Diffusion Models☆44Updated 8 months ago
- Derivative-Free Guidance in Diffusion Models with Soft Value-Based Decoding. For controlled generation in DNA, RNA, proteins, molecules (…☆32Updated 11 months ago
- ☆36Updated last year
- Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models☆71Updated last year
- Simple Guidance Mechanisms for Discrete Diffusion Models☆51Updated 9 months ago
- Official Code for Local Search GFlowNets (ICLR 2024 Spotlight)☆21Updated 6 months ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆34Updated last year
- Reward fine-tuning for Stable Diffusion models based on stochastic optimal control, including Adjoint Matching☆43Updated 3 months ago
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆27Updated 4 months ago
- Code for paper: "Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design"☆58Updated 4 months ago
- 📰 Must-Read Papers on Offline Model-Based Optimization 🔥☆24Updated 2 months ago
- ☆25Updated last year
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆74Updated 4 months ago
- Code repository for Trajectory Flow Matching☆82Updated 10 months ago
- ☆34Updated 5 months ago
- Code for the paper Iterated Denoising Energy Matching for Sampling from Boltzmann Densities.☆62Updated 5 months ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆125Updated 6 months ago
- GflowNets, MCMC, Metropolis-Hasting, Gibbs sampling, Metropolis-adjusted Langevin, Inverse Transform Sampling, Acceptance-Rejection Metho…☆86Updated 2 years ago
- Entropic Optimal Transport Benchmark (NeurIPS 2023).☆25Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆186Updated 2 months ago
- ☆32Updated 2 years ago
- ☆18Updated last year
- Graph Diffusion Policy Optimization☆39Updated last year
- Code for the paper https://arxiv.org/abs/2402.04997☆93Updated last year
- ☆34Updated last year
- Code release for "Stochastic Optimal Control Matching"☆37Updated last year
- This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…☆15Updated 2 years ago
- ☆32Updated 3 months ago
- ☆19Updated 2 years ago