irregular-rhomboid / EAI-Math-Reading-GroupLinks
Resources from the EleutherAI Math Reading Group
☆53Updated 4 months ago
Alternatives and similar repositories for EAI-Math-Reading-Group
Users that are interested in EAI-Math-Reading-Group are comparing it to the libraries listed below
Sorting:
- git extension for {collaborative, communal, continual} model development☆214Updated 7 months ago
- A puzzle to learn about prompting☆130Updated 2 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆77Updated 3 years ago
- Neural Networks and the Chomsky Hierarchy☆206Updated last year
- 🧠 Starter templates for doing interpretability research☆72Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆84Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- ☆79Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆141Updated 2 weeks ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆127Updated 2 years ago
- An interactive exploration of Transformer programming.☆265Updated last year
- Erasing concepts from neural representations with provable guarantees☆230Updated 5 months ago
- Inference code for LLaMA models in JAX☆118Updated last year
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆182Updated 2 years ago
- 🧱 Modula software package☆202Updated 3 months ago
- Train very large language models in Jax.☆205Updated last year
- See the issue board for the current status of active and prospective projects!☆65Updated 3 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆207Updated this week
- Understand and test language model architectures on synthetic tasks.☆219Updated last month
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- ☆273Updated 11 months ago
- ☆166Updated 2 years ago
- ☆53Updated last year
- LoRA for arbitrary JAX models and functions☆140Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆187Updated 3 years ago
- ☆61Updated 3 years ago
- nanoGPT-like codebase for LLM training☆99Updated last month
- Puzzles for exploring transformers☆354Updated 2 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆129Updated last year
- Utilities for the HuggingFace transformers library☆68Updated 2 years ago