irregular-rhomboid / EAI-Math-Reading-GroupLinks
Resources from the EleutherAI Math Reading Group
โ54Updated 9 months ago
Alternatives and similar repositories for EAI-Math-Reading-Group
Users that are interested in EAI-Math-Reading-Group are comparing it to the libraries listed below
Sorting:
- ๐ง Starter templates for doing interpretability researchโ74Updated 2 years ago
- โ166Updated 2 years ago
- An interactive exploration of Transformer programming.โ270Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paperโ130Updated 3 years ago
- A puzzle to learn about promptingโ135Updated 2 years ago
- git extension for {collaborative, communal, continual} model developmentโ216Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ174Updated 5 months ago
- Train very large language models in Jax.โ210Updated 2 years ago
- โ285Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"โ79Updated 3 years ago
- LoRA for arbitrary JAX models and functionsโ143Updated last year
- Functional local implementations of main model parallelism approachesโ95Updated 2 years ago
- Large scale 4D parallelism pre-training for ๐ค transformers in Mixture of Experts *(still work in progress)*โ87Updated last year
- Neural Networks and the Chomsky Hierarchyโ211Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)โ195Updated last year
- Understand and test language model architectures on synthetic tasks.โ243Updated 2 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingโ132Updated last year
- Erasing concepts from neural representations with provable guaranteesโ239Updated 10 months ago
- A set of Python scripts that makes your experience on TPU betterโ54Updated 2 months ago
- โ62Updated 3 years ago
- โ91Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Trainingโ51Updated last year
- A MAD laboratory to improve AI architecture designs ๐งชโ135Updated 11 months ago
- Extract full next-token probabilities via language model APIsโ248Updated last year
- Code associated to papers on superposition (in ML interpretability)โ33Updated 3 years ago
- See the issue board for the current status of active and prospective projects!โ65Updated 3 years ago
- A library to create and manage configuration files, especially for machine learning projects.โ79Updated 3 years ago
- Emergent world representations: Exploring a sequence model trained on a synthetic taskโ191Updated 2 years ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.โ297Updated last year
- MinT: Minimal Transformer Library and Tutorialsโ259Updated 3 years ago