knsiczarnamagia / kaggle-tutorialLinks
Kick-off repository for starting with Kaggle!
☆12Updated 8 months ago
Alternatives and similar repositories for kaggle-tutorial
Users that are interested in kaggle-tutorial are comparing it to the libraries listed below
Sorting:
- ☆10Updated 8 months ago
- ☆27Updated 7 months ago
- course.fast.ai 2022 part 2☆502Updated last year
- Efficient optimizers☆253Updated last week
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆560Updated last year
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆603Updated last year
- minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.☆470Updated 2 years ago
- Getting started with diffusion☆660Updated last year
- A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.☆924Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆275Updated 3 weeks ago
- Annotated version of the Mamba paper☆487Updated last year
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆556Updated 7 months ago
- Self-contained, minimalistic implementation of diffusion models with Pytorch.☆1,084Updated 3 years ago
- UNet diffusion model in pure CUDA☆615Updated last year
- ☆307Updated last year
- Diffusion Reading Group at EleutherAI☆324Updated 2 years ago
- My annotated papers and meeting recordings for the EleutherAI ML Performance research paper reading group☆19Updated 3 weeks ago
- maximal update parametrization (µP)☆1,579Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆290Updated last year
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆424Updated 2 years ago
- supporting pytorch FSDP for optimizers☆84Updated 8 months ago
- Text to Image Latent Diffusion using a Transformer core☆198Updated 11 months ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆965Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆349Updated last year
- A pytorch quantization backend for optimum☆979Updated last month
- Sparsify transformers with SAEs and transcoders☆604Updated this week
- The AdEMAMix Optimizer: Better, Faster, Older.☆184Updated 11 months ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆226Updated last year
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,505Updated 7 months ago
- For optimization algorithm research and development.☆524Updated last week