Code for "Optimizing DDPM Sampling with Shortcut Fine-Tuning" (https://arxiv.org/abs/2301.13362), ICML 2023
☆30Oct 6, 2023Updated 2 years ago
Alternatives and similar repositories for SFT-PG
Users that are interested in SFT-PG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of JEM++: Improved Techniques for Training JEM☆13Mar 11, 2023Updated 3 years ago
- Official implementation of MINDE: Mutual Information Neural Diffusion Estimation☆23Apr 17, 2025Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- ☆59Sep 23, 2024Updated last year
- PyTorch implementation of denoising diffusion probabilistic models.☆32Apr 13, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆28Jan 21, 2026Updated 4 months ago
- DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching (CVPR'25)☆20Jun 3, 2025Updated 11 months ago
- Code for the paper "Training Diffusion Models with Reinforcement Learning"☆570Jul 5, 2023Updated 2 years ago
- DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support☆763Mar 22, 2024Updated 2 years ago
- ☆21Jul 6, 2025Updated 10 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆40Sep 22, 2024Updated last year
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of Avoiding spurious correlations via logit correction☆17May 6, 2023Updated 3 years ago
- Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)☆15Feb 20, 2023Updated 3 years ago
- Official codebase for the NeurIPS 2023 paper: Towards Last-layer Retraining for Group Robustness with Fewer Annotations. https://arxiv.or…☆12May 15, 2024Updated 2 years ago
- SprintSeoul Homepage☆15Feb 23, 2022Updated 4 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆11Aug 26, 2024Updated last year
- Background Subtraction for complex scenes such as intersections from surveillance cameras☆10Jul 15, 2022Updated 3 years ago
- A GUI which enables drawing using Python scripts☆10Mar 27, 2024Updated 2 years ago
- ☆13Aug 14, 2022Updated 3 years ago
- Have an LLM write your biography, probably incorrectly☆14Dec 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code release for the paper "Calibrating Energy-based Generative Adversarial Networks"☆24Oct 31, 2017Updated 8 years ago
- Deep Generative Models (Chainer)☆10Oct 12, 2017Updated 8 years ago
- The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility☆69Aug 16, 2025Updated 9 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning☆50Jun 17, 2025Updated 11 months ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- The PyTorch implementation of the GLF☆22Oct 12, 2021Updated 4 years ago
- A latent flow-based diffusion model trained on the 2012 ImageNet dataset from scratch.☆25May 21, 2025Updated last year
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆155Jun 25, 2024Updated last year
- ☆15Apr 3, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Realistic gramophone noise synthesis using a diffusion model☆18Aug 28, 2022Updated 3 years ago
- We propose a theoretically motivated method, Adversarial Training with informative Outlier Mining (ATOM), which improves the robustness o…☆57Feb 17, 2022Updated 4 years ago
- [CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"☆243Apr 6, 2024Updated 2 years ago
- Physics-based Zero-Shot Video Generation☆31Oct 4, 2024Updated last year
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"☆50Apr 7, 2026Updated last month
- Reproduction of DDPO paper (RLHF for diffusion)☆93Sep 20, 2023Updated 2 years ago