CarperAI / DRLX
Diffusion Reinforcement Learning Library
☆179Updated last year
Alternatives and similar repositories for DRLX:
Users that are interested in DRLX are comparing it to the libraries listed below
- This is the official repo for the paper "Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles", Tang et…☆196Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆408Updated last year
- Faster generation with text-to-image diffusion models.☆210Updated 4 months ago
- AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more samp…☆270Updated 3 months ago
- Simple large-scale training of stable diffusion with multi-node support.☆128Updated last year
- Iterable datapipelines for pytorch training.☆81Updated 5 months ago
- ☆125Updated 4 months ago
- Reproduction of DDPO paper (RLHF for diffusion)☆81Updated last year
- ☆322Updated 5 months ago
- Better Aligning Text-to-Image Models with Human Preference. ICCV 2023☆273Updated last year
- Implementation of Key-Locked Rank One Editing, from Nvidia AI☆232Updated last year
- Code for instruction-tuning Stable Diffusion.☆221Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆153Updated last year
- Implementation of HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models☆167Updated last year
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆190Updated 10 months ago
- ☆416Updated 10 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆68Updated 8 months ago
- Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".☆285Updated 3 months ago
- ☆474Updated last month
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆360Updated 5 months ago
- ☆117Updated 2 years ago
- ☆31Updated last year
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆381Updated 5 months ago
- Implementation of "SVDiff: Compact Parameter Space for Diffusion Fine-Tuning"☆375Updated last year
- ☆94Updated 8 months ago
- ☆166Updated 2 years ago
- Fast finetuning using a booster model that puts the initial state to a local minimum☆113Updated last year
- Code for "Diffusion Model Alignment Using Direct Preference Optimization"☆345Updated 2 weeks ago
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…☆76Updated 4 months ago
- Implementation of Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models☆322Updated last year