irregular-rhomboid / EAI-Math-Reading-GroupLinks

Resources from the EleutherAI Math Reading Group

☆53

Alternatives and similar repositories for EAI-Math-Reading-Group

Users that are interested in EAI-Math-Reading-Group are comparing it to the libraries listed below

Sorting:

apartresearch / interpretability-starter
🧠 Starter templates for doing interpretability research
☆73Updated 2 years ago
srush / GPTWorld
A puzzle to learn about prompting
☆132Updated 2 years ago
Sea-Snell / grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆77Updated 3 years ago
srush / raspy
An interactive exploration of Transformer programming.
☆268Updated last year
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆127Updated 2 years ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆221Updated 2 weeks ago
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆217Updated 8 months ago
modula-systems / modula
🧱 Modula software package
☆210Updated this week
google-deepmind / neural_networks_chomsky_hierarchy
Neural Networks and the Chomsky Hierarchy
☆207Updated last year
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆190Updated last year
srush / Transformer-Puzzles
Puzzles for exploring transformers
☆355Updated 2 years ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆231Updated 6 months ago
srush / Autodiff-Puzzles
☆443Updated 9 months ago
likenneth / othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆184Updated 2 years ago
cloneofsimo / min-fsdp
☆82Updated last year
google-deepmind / nanodo
☆274Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆149Updated last month
srush / do-we-need-attention
☆166Updated 2 years ago
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆205Updated last year
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆130Updated last year
EleutherAI / elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆207Updated last week
mxbi / arckit
Tools for working with the Abstraction & Reasoning Corpus
☆196Updated 11 months ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆123Updated 7 months ago
nostalgebraist / transformer-utils
Utilities for the HuggingFace transformers library
☆70Updated 2 years ago
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆140Updated last year
wattenberg / superposition
Code associated to papers on superposition (in ML interpretability)
☆29Updated 2 years ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆219Updated last year
timaeus-research / devinterp
Tools for studying developmental interpretability in neural networks.
☆100Updated last month
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆94Updated 2 years ago