srush / raspyLinks

An interactive exploration of Transformer programming.

☆269

Alternatives and similar repositories for raspy

Users that are interested in raspy are comparing it to the libraries listed below

Sorting:

tech-srl / RASP
An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"
☆320Updated last year
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆215Updated 11 months ago
srush / Transformer-Puzzles
Puzzles for exploring transformers
☆371Updated 2 years ago
google-deepmind / tracr
☆546Updated last year
srush / Autodiff-Puzzles
☆456Updated last year
srush / GPTWorld
A puzzle to learn about prompting
☆135Updated 2 years ago
google-deepmind / nanodo
☆283Updated last year
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
marin-community / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆669Updated this week
jxbz / agd
Automatic gradient descent
☆213Updated 2 years ago
irregular-rhomboid / EAI-Math-Reading-Group
Resources from the EleutherAI Math Reading Group
☆54Updated 7 months ago
srush / do-we-need-attention
☆166Updated 2 years ago
modula-systems / modula
🧱 Modula software package
☆287Updated 2 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆167Updated 2 months ago
RobertRiachi / nanoPALM
☆144Updated 2 years ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆233Updated 3 weeks ago
yashbonde / rasp
Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.
☆58Updated 4 years ago
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆213Updated 4 months ago
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆96Updated 2 years ago
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆34Updated last year
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆209Updated last year
google-deepmind / neural_networks_chomsky_hierarchy
Neural Networks and the Chomsky Hierarchy
☆210Updated last year
awf / functional-transformer
A pure-functional implementation of a machine learning transformer model in Python/JAX
☆180Updated 5 months ago
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆518Updated 2 years ago
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆350Updated last year
kingoflolz / swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆242Updated 2 years ago
HenryNdubuaku / nanodl
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆294Updated last year
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆129Updated 3 years ago
marin-community / haliax
Named Tensors for Legible Deep Learning in JAX
☆210Updated this week