knsiczarnamagia / kaggle-tutorialLinks
Kick-off repository for starting with Kaggle!
☆12Updated 9 months ago
Alternatives and similar repositories for kaggle-tutorial
Users that are interested in kaggle-tutorial are comparing it to the libraries listed below
Sorting:
- ☆10Updated 10 months ago
- ☆27Updated 9 months ago
- Efficient optimizers☆261Updated this week
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆612Updated last year
- course.fast.ai 2022 part 2☆505Updated last year
- supporting pytorch FSDP for optimizers☆84Updated 9 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆301Updated 2 months ago
- Getting started with diffusion☆666Updated last year
- ☆20Updated 5 months ago
- A template for starting reproducible Python machine-learning projects with hardware acceleration. Find an example at https://github.com/C…☆106Updated 3 months ago
- ☆14Updated last year
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆970Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,349Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆291Updated last year
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆426Updated 9 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,010Updated last year
- The repository for the code of the UltraFastBERT paper☆519Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆349Updated last year
- ☆214Updated 9 months ago
- Diffusion Reading Group at EleutherAI☆325Updated 2 years ago
- D-Adaptation for SGD, Adam and AdaGrad☆525Updated 8 months ago
- Puzzles for exploring transformers☆371Updated 2 years ago
- UNet diffusion model in pure CUDA☆647Updated last year
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.☆938Updated last year
- ☆281Updated last year
- The Prodigy optimizer and its variants for training neural networks.☆419Updated 8 months ago
- ☆200Updated 8 months ago
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- Language Modeling with the H3 State Space Model☆518Updated last year