aidangomez / welcomeLinks
Generate a cute welcome message for yourself each day
☆22Updated 2 years ago
Alternatives and similar repositories for welcome
Users that are interested in welcome are comparing it to the libraries listed below
Sorting:
- ☆53Updated last year
 - ☆62Updated 3 years ago
 - seqax = sequence modeling + JAX☆168Updated 3 months ago
 - Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆188Updated 3 years ago
 - HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated last year
 - JAX implementation of the Llama 2 model☆216Updated last year
 - Train very large language models in Jax.☆209Updated 2 years ago
 - LoRA for arbitrary JAX models and functions☆141Updated last year
 - An interactive exploration of Transformer programming.☆269Updated last year
 - JAX Implementation of Black Forest Labs' Flux.1 family of models☆39Updated last month
 - Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
 - ☆283Updated last year
 - Maximal Update Parametrization (μP) with Flax & Optax.☆16Updated last year
 - Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
 - Resources from the EleutherAI Math Reading Group☆54Updated 8 months ago
 - Fast bare-bones BPE for modern tokenizer training☆168Updated 4 months ago
 - ☆310Updated last year
 - supporting pytorch FSDP for optimizers☆83Updated 10 months ago
 - JAX Synergistic Memory Inspector☆179Updated last year
 - ☆91Updated last year
 - ☆144Updated 2 years ago
 - Explorations into the recently proposed Taylor Series Linear Attention☆99Updated last year
 - A case study of efficient training of large language models using commodity hardware.☆68Updated 3 years ago
 - A comprehensive deep dive into the world of tokens☆226Updated last year
 - A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆295Updated last year
 - A simple library for scaling up JAX programs☆144Updated last year
 - some common Huggingface transformers in maximal update parametrization (µP)☆86Updated 3 years ago
 - Simple Transformer in Jax☆139Updated last year
 - git extension for {collaborative, communal, continual} model development☆215Updated 11 months ago
 - LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆58Updated 3 years ago