tech-srl / RASP-exps
Code for running the transformers in the ICML 2021 paper "Thinking Like Transformers"
☆16Updated 3 years ago
Alternatives and similar repositories for RASP-exps
Users that are interested in RASP-exps are comparing it to the libraries listed below
Sorting:
- Implementing RASP transformer programming language https://arxiv.org/pdf/2106.06981.pdf.☆53Updated 3 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- A programming language for formal/informal computation.☆41Updated last month
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆188Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆77Updated last year
- Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep R…☆170Updated last year
- Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes☆239Updated 2 years ago
- One stop shop for all things carp☆59Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- ☆60Updated 3 years ago
- ☆65Updated 3 years ago
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated 9 months ago
- An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"☆309Updated 8 months ago
- LoRA for arbitrary JAX models and functions☆136Updated last year
- slowly building a set of infinite riddle generators for data-hungry methods☆12Updated 2 years ago
- Latent Diffusion Language Models☆68Updated last year
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆207Updated 4 months ago
- Automatic gradient descent☆207Updated last year
- An interactive exploration of Transformer programming.☆264Updated last year
- Language-annotated Abstraction and Reasoning Corpus☆86Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- FastFeedForward Networks☆19Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆32Updated 11 months ago
- A dataset of alignment research and code to reproduce it☆77Updated last year
- ☆39Updated 3 years ago
- Enjoy puzzle-solving directly in your browser.☆25Updated last month
- ☆59Updated 3 years ago
- Abstraction and Reasoning Corpus☆14Updated 2 years ago