google-deepmind / tracr
☆531Updated last year
Alternatives and similar repositories for tracr:
Users that are interested in tracr are comparing it to the libraries listed below
- An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"☆308Updated 7 months ago
- An interactive exploration of Transformer programming.☆262Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆488Updated 10 months ago
- Neural Networks and the Chomsky Hierarchy☆205Updated last year
- ☆265Updated last year
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆181Updated last year
- Erasing concepts from neural representations with provable guarantees☆227Updated 3 months ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆199Updated this week
- Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"☆477Updated 2 years ago
- Language Modeling with the H3 State Space Model☆520Updated last year
- Mechanistic Interpretability Visualizations using React☆241Updated 4 months ago
- Automatic gradient descent☆207Updated last year
- Puzzles for exploring transformers☆343Updated last year
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆207Updated 3 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆309Updated 2 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆569Updated this week
- ☆219Updated 6 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆189Updated 11 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆120Updated 2 years ago
- A puzzle to learn about prompting☆127Updated last year
- git extension for {collaborative, communal, continual} model development☆211Updated 5 months ago
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆210Updated last year
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆187Updated 3 years ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆255Updated last year
- ☆121Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,331Updated 10 months ago
- Extract full next-token probabilities via language model APIs☆241Updated last year
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆310Updated 2 years ago
- ☆349Updated last year
- ☆205Updated last year