Decoder only transformer, built from scratch with PyTorch
☆33Oct 22, 2023Updated 2 years ago
Alternatives and similar repositories for transformer-from-scratch
Users that are interested in transformer-from-scratch are comparing it to the libraries listed below
Sorting:
- Repository with sample code using Apollo's suggested engineering practices☆15Dec 16, 2024Updated last year
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- ☆12Jul 12, 2024Updated last year
- ☆20Nov 15, 2024Updated last year
- A python sdk for LLM finetuning and inference on runpod infrastructure☆20Feb 16, 2026Updated 3 weeks ago
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- ☆36Apr 30, 2024Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28May 23, 2024Updated last year
- Code for our paper "Localizing Lying in Llama"☆13Apr 24, 2025Updated 10 months ago
- Sparse Autoencoder for Mechanistic Interpretability☆292Jul 20, 2024Updated last year
- ☆156Dec 30, 2025Updated 2 months ago
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- ☆399Aug 21, 2025Updated 6 months ago
- Mechanistic Interpretability Visualizations using React☆328Dec 18, 2024Updated last year
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆248Feb 27, 2026Updated last week
- A blog on AI, personal development, and living a good life.☆35Updated this week
- Efficiently computing & storing token n-grams from large corpora☆27Oct 6, 2024Updated last year
- epsilon machines and transformers!☆34Jul 9, 2025Updated 8 months ago
- Chrome extension that restores the Dim (dark blue) background theme on X/Twitter☆36Feb 26, 2026Updated last week
- ☆13Oct 5, 2025Updated 5 months ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 8 months ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆43Updated this week
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated last year
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆160Feb 27, 2026Updated last week
- LiT (Zero-Shot Transfer with Locked-image text Tuning) image and text encoder models, working in the browser☆11May 16, 2022Updated 3 years ago
- AlgZoo: uninterpreted models with fewer than 1,500 parameters☆45Jan 19, 2026Updated last month
- ☆14Apr 29, 2025Updated 10 months ago
- ☆15Feb 12, 2026Updated 3 weeks ago
- A Gentle Introduction to RAG☆15Oct 8, 2024Updated last year
- Project to Accompany my YouTube Video on this topic☆11Sep 28, 2024Updated last year
- Mastodoner is a command line tool (and Python library) for archiving Mastodon, a decentralized micro-blogging social network.☆13Oct 21, 2024Updated last year
- ☆11Feb 21, 2025Updated last year
- Run a raffle among the 🌟 stargazers 🌟 of a Github project!☆11Mar 23, 2023Updated 2 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- Sample notebooks for Juno☆11Mar 1, 2025Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆570Aug 7, 2025Updated 7 months ago
- A python package for protein inference in Mass Spectrometric data analysis.☆10Jun 6, 2022Updated 3 years ago