hal-314 / fastai-batch-size-finderLinks
Implementation of OpenAI paper with Simple Noise Scale on Fastai V2
☆19Updated 4 years ago
Alternatives and similar repositories for fastai-batch-size-finder
Users that are interested in fastai-batch-size-finder are comparing it to the libraries listed below
Sorting:
- A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Automatically take good care of your preemptible TPUs☆36Updated 2 years ago
- Build fast gradio demos of fastai learners☆35Updated 3 years ago
- Train fastai models faster (and other useful tools)☆70Updated 2 months ago
- My explorations into editing the knowledge and memories of an attention network☆35Updated 2 years ago
- Utilities for PyTorch distributed☆24Updated 5 months ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- Utilities for Training Very Large Models☆58Updated 10 months ago
- In-place debugger for the fastai library and pytorch☆28Updated 3 years ago
- ☆31Updated 2 months ago
- PyTorch interface for TrueGrad Optimizers☆42Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated 2 years ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 4 years ago
- Research repo for code that may or may not end up in fastai3☆50Updated 4 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Amos optimizer with JEstimator lib.☆82Updated last year
- A library for squeakily cleaning and filtering language datasets.☆47Updated 2 years ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆59Updated last week
- A sample pattern for running CI tests on Modal☆18Updated 4 months ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Load any clip model with a standardized interface☆22Updated this week
- A dashboard for exploring timm learning rate schedulers☆19Updated 9 months ago
- ☆15Updated 4 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- Latent Diffusion Language Models☆69Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆108Updated 3 weeks ago