dasayan05 / iclr24_blog_code
Code repo for ICLR 24 BlogPost titled "Building Diffusion Model's theory from ground up"
☆13Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for iclr24_blog_code
- FID computation in Jax/Flax.☆24Updated 4 months ago
- Code for minimum-entropy coupling.☆30Updated 4 months ago
- Open source code for EigenGame.☆28Updated last year
- Utilities for PyTorch distributed☆23Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated last year
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 4 months ago
- Clean RL implementation using MLX☆27Updated 8 months ago
- ☆19Updated 7 months ago
- AdaCat☆49Updated 2 years ago
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆64Updated this week
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆12Updated last year
- Generate bird's-eye views of conference proceedings.☆22Updated 4 months ago
- Automatically take good care of your preemptible TPUs☆32Updated last year
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆104Updated 2 years ago
- ☆26Updated last year
- ☆58Updated 2 years ago
- Hacks for PyTorch☆17Updated last year
- This is a port of Mistral-7B model in JAX☆30Updated 4 months ago
- Running Jax in PyTorch Lightning☆82Updated 2 weeks ago
- ☆37Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆30Updated last year
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆17Updated 3 weeks ago
- A system for automating selection and optimization of pre-trained models from the TAO Model Zoo☆22Updated 4 months ago
- ☆40Updated 4 months ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 2 years ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆29Updated 3 weeks ago
- ML/DL Math and Method notes☆57Updated 11 months ago