dasayan05 / iclr24_blog_code
Code repo for ICLR 24 BlogPost titled "Building Diffusion Model's theory from ground up"
☆13Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for iclr24_blog_code
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 3 months ago
- FID computation in Jax/Flax.☆24Updated 3 months ago
- ☆58Updated 2 years ago
- Generate bird's-eye views of conference proceedings.☆22Updated 3 months ago
- ☆37Updated 2 years ago
- Utilities for PyTorch distributed☆23Updated last year
- ☆26Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆64Updated 3 months ago
- AdaCat☆49Updated 2 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated last year
- Automatically take good care of your preemptible TPUs☆31Updated last year
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆12Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated last year
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Hacks for PyTorch☆17Updated last year
- PyTorch interface for TrueGrad Optimizers☆39Updated last year
- Code for minimum-entropy coupling.☆29Updated 4 months ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆25Updated 3 weeks ago
- ML/DL Math and Method notes☆57Updated 11 months ago
- ☆18Updated 6 months ago
- Fast training of unitary deep network layers from low-rank updates☆28Updated last year
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 2 years ago
- Texture mapping with variational auto-encoders☆40Updated 3 years ago
- Clean RL implementation using MLX☆25Updated 8 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆104Updated 2 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆78Updated 9 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆29Updated last week
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Updated 3 years ago