plutonium-239 / memsave_torch
Lowering PyTorch's Memory Consumption for Selective Differentiation
☆10Updated 7 months ago
Alternatives and similar repositories for memsave_torch:
Users that are interested in memsave_torch are comparing it to the libraries listed below
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆115Updated last month
- ☆30Updated 2 months ago
- ☆32Updated 4 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆16Updated 3 weeks ago
- Generative Equilibrium Transformer☆17Updated last year
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆54Updated last week
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆106Updated 5 months ago
- The Superposition of Diffusion Models Using the Itô Density Estimator☆33Updated last week
- ☆17Updated 2 months ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆30Updated 5 months ago
- ☆51Updated last year
- [ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"☆53Updated 11 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆80Updated last week
- ☆41Updated 10 months ago
- Code for "Journey to the BAOAB-limit: finding effective MCMC samplers for score-based models". See more at https://ajayj.com/journey.☆12Updated 2 years ago
- [NeurIPS 2023] Formulating Discrete Probability Flow Through Optimal Transport☆20Updated last year
- ☆13Updated 10 months ago
- Code for paper "Principal Components" Enable A New Language of Images☆23Updated last week
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling".☆23Updated 2 weeks ago
- Minimum implementation of EDM (Elucidating the Design Space of Diffusion-Based Generative Models) on cifar10 and mnist☆50Updated last year
- Official implementation of the paper The Hidden Language of Diffusion Models☆72Updated last year
- Official code for "On Calibrating Diffusion Probabilistic Models"☆29Updated 2 years ago
- ☆33Updated 6 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆44Updated 3 weeks ago
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆73Updated last year
- ☆33Updated 2 months ago
- Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models☆67Updated last year
- Official Repository for "Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation" [ICLR 2024]☆14Updated 10 months ago
- Official code for the paper "Attention as a Hypernetwork"☆25Updated 9 months ago
- The official repo of continuous speculative decoding☆25Updated this week