cloneofsimo / karras-power-ema-tutorialLinks
☆51Updated last year
Alternatives and similar repositories for karras-power-ema-tutorial
Users that are interested in karras-power-ema-tutorial are comparing it to the libraries listed below
Sorting:
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆80Updated 10 months ago
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆48Updated last year
- ☆27Updated last year
- ☆34Updated 9 months ago
- WIP☆93Updated 10 months ago
- ☆13Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆81Updated 6 months ago
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- ☆39Updated last year
- ☆23Updated last year
- ☆33Updated 7 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆78Updated last year
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆32Updated last year
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆157Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 3 months ago
- ☆17Updated 7 months ago
- ☆24Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆155Updated last week
- Focused on fast experimentation and simplicity☆74Updated 6 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- Synthetic Alphabet Dataset☆19Updated 2 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- ☆24Updated last month
- ☆73Updated 2 years ago
- ☆20Updated 8 months ago
- Official code for "On Calibrating Diffusion Probabilistic Models"☆29Updated 2 years ago
- A demo for the Direct Ascent Synthesis: Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)☆39Updated 3 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- Official implementation of the paper The Hidden Language of Diffusion Models☆73Updated last year
- ☆42Updated 9 months ago