aidangomez / welcomeLinks
Generate a cute welcome message for yourself each day
☆22Updated 2 years ago
Alternatives and similar repositories for welcome
Users that are interested in welcome are comparing it to the libraries listed below
Sorting:
- ☆61Updated 3 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆187Updated 3 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆85Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- ☆53Updated last year
- Resources from the EleutherAI Math Reading Group☆53Updated 4 months ago
- An interactive exploration of Transformer programming.☆265Updated last year
- Train very large language models in Jax.☆204Updated last year
- seqax = sequence modeling + JAX☆165Updated last month
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆34Updated 8 months ago
- JAX implementation of the Llama 2 model☆219Updated last year
- Automatic gradient descent☆208Updated 2 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆129Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- Train vision models using JAX and 🤗 transformers☆98Updated 3 months ago
- LoRA for arbitrary JAX models and functions☆140Updated last year
- supporting pytorch FSDP for optimizers☆83Updated 7 months ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Updated last year
- ☆274Updated last year
- Fast bare-bones BPE for modern tokenizer training☆159Updated 3 weeks ago
- ☆80Updated last year
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆348Updated 11 months ago
- Simple Transformer in Jax☆138Updated last year
- WIP☆93Updated 11 months ago
- A set of Python scripts that makes your experience on TPU better☆55Updated last year
- A dataset of alignment research and code to reproduce it☆77Updated 2 years ago
- See the issue board for the current status of active and prospective projects!☆65Updated 3 years ago
- git extension for {collaborative, communal, continual} model development☆214Updated 8 months ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- ☆143Updated 2 years ago