irregular-rhomboid / EAI-Math-Reading-Group
Resources from the EleutherAI Math Reading Group
β52Updated 2 months ago
Alternatives and similar repositories for EAI-Math-Reading-Group:
Users that are interested in EAI-Math-Reading-Group are comparing it to the libraries listed below
- A puzzle to learn about promptingβ121Updated last year
- π§ Starter templates for doing interpretability researchβ63Updated last year
- β161Updated last year
- 𧱠Modula software packageβ130Updated 3 weeks ago
- Erasing concepts from neural representations with provable guaranteesβ214Updated last month
- An interactive exploration of Transformer programming.β253Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paperβ99Updated 2 years ago
- A MAD laboratory to improve AI architecture designs π§ͺβ95Updated 7 months ago
- β199Updated 5 months ago
- Puzzles for exploring transformersβ327Updated last year
- A pure-functional implementation of a machine learning transformer model in Python/JAXβ175Updated 2 years ago
- β138Updated 2 weeks ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingβ113Updated 8 months ago
- β394Updated last month
- Large scale 4D parallelism pre-training for π€ transformers in Mixture of Experts *(still work in progress)*β80Updated last year
- Sparse and discrete interpretability tool for neural networksβ56Updated 10 months ago
- A library to create and manage configuration files, especially for machine learning projects.β77Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β85Updated 3 weeks ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditionβ¦β142Updated this week
- Neural Networks and the Chomsky Hierarchyβ189Updated 8 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Trainingβ46Updated 10 months ago
- Functional local implementations of main model parallelism approachesβ95Updated last year
- git extension for {collaborative, communal, continual} model developmentβ206Updated last month
- LoRA for arbitrary JAX models and functionsβ134Updated 9 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)β184Updated 2 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)β181Updated 6 months ago
- β74Updated 5 months ago
- Understand and test language model architectures on synthetic tasks.β166Updated 7 months ago