Francesco215 / autoregressive_diffusionLinks
Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP
☆28Updated last week
Alternatives and similar repositories for autoregressive_diffusion
Users that are interested in autoregressive_diffusion are comparing it to the libraries listed below
Sorting:
- ☆98Updated 5 months ago
- WIP☆93Updated 10 months ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆25Updated 4 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆34Updated 8 months ago
- Focused on fast experimentation and simplicity☆75Updated 6 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 6 months ago
- My take on Flow Matching☆63Updated 5 months ago
- Implementations of attention with the softpick function, naive and FlashAttention-2☆79Updated last month
- ☆46Updated 7 months ago
- ☆47Updated 4 months ago
- supporting pytorch FSDP for optimizers☆82Updated 6 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 7 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆87Updated 3 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- ☆53Updated last year
- Synthetic Alphabet Dataset☆19Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- ☆24Updated last month
- ☆27Updated last year
- ☆34Updated 9 months ago
- ☆51Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆65Updated 10 months ago
- RS-IMLE☆40Updated 6 months ago
- ☆30Updated 8 months ago
- ☆22Updated last month
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 5 months ago
- Getting crystal-like representations with harmonic loss☆190Updated 2 months ago
- ☆31Updated last year
- Code for the Fractured Entangled Representation Hypothesis position paper!☆103Updated last month
- ☆33Updated 5 months ago