brianlck / FlexMDMLinks
☆25Updated last month
Alternatives and similar repositories for FlexMDM
Users that are interested in FlexMDM are comparing it to the libraries listed below
Sorting:
- Official Jax Implementation of MD4 Masked Diffusion Models☆132Updated 7 months ago
- ☆73Updated last year
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆102Updated 2 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆71Updated 4 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆55Updated 10 months ago
- NF-Layers for constructing neural functionals.☆90Updated last year
- Flash Attention Triton kernel with support for second-order derivatives☆101Updated last week
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- ☆120Updated 4 months ago
- ☆28Updated 2 weeks ago
- Flow-matching algorithms in JAX☆105Updated last year
- Beyond Straight-Through☆102Updated 2 years ago
- [ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning☆131Updated 3 weeks ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆44Updated 3 weeks ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆79Updated 5 months ago
- Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"☆27Updated 4 months ago
- ☆33Updated 10 months ago
- ☆33Updated 11 months ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆77Updated 8 months ago
- Code accompanying the paper "Generalized Interpolating Discrete Diffusion"☆103Updated 4 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆37Updated last year
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14Updated 4 months ago
- Stick-breaking attention☆60Updated 3 months ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- ☆14Updated last year
- Ying Nian Wu's UCLA Statistical Machine Learning Tutorial on generative modeling.☆61Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆68Updated last year
- ☆107Updated 2 years ago
- Neural Optimal Transport with Lagrangian Costs☆58Updated 4 months ago
- Experiment with diffusion models that you can run on your local jupyter instances☆63Updated 11 months ago