yizhe-ang / interactive-transformer
A visual interface for understanding and interpreting Transformers
☆77Updated last year
Alternatives and similar repositories for interactive-transformer:
Users that are interested in interactive-transformer are comparing it to the libraries listed below
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 10 months ago
- Sparse autoencoders for Contra text embedding models☆25Updated 11 months ago
- Simple Transformer in Jax☆136Updated 9 months ago
- ☆48Updated last year
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆97Updated this week
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- ☆60Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 3 weeks ago
- ☆124Updated this week
- ☆26Updated 11 months ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- Stream of my favorite papers and links☆41Updated last week
- ☆87Updated last week
- A puzzle to learn about prompting☆124Updated last year
- look how they massacred my boy☆63Updated 5 months ago
- Drive a browser with Cohere☆72Updated last year
- compute, storage, and networking infra at home☆65Updated last year
- Compiling useful links, papers, benchmarks, ideas, etc.☆41Updated last week
- The history files when recording human interaction while solving ARC tasks☆97Updated this week
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- ☆97Updated 5 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆190Updated 3 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Extract full next-token probabilities via language model APIs☆237Updated last year
- An introduction to LLM Sampling☆77Updated 3 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆67Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆85Updated this week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆52Updated last week
- ☆67Updated last month
- A repository for training nanogpt-based Chess playing language models.☆23Updated 11 months ago