alan-cooney / transformer-from-scratchView external linksLinks
Decoder only transformer, built from scratch with PyTorch
☆32Oct 22, 2023Updated 2 years ago
Alternatives and similar repositories for transformer-from-scratch
Users that are interested in transformer-from-scratch are comparing it to the libraries listed below
Sorting:
- Repository with sample code using Apollo's suggested engineering practices☆15Dec 16, 2024Updated last year
- A python sdk for LLM finetuning and inference on runpod infrastructure☆17Updated this week
- ☆20Nov 15, 2024Updated last year
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- ☆36Apr 30, 2024Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28May 23, 2024Updated last year
- Code for our paper "Localizing Lying in Llama"☆13Apr 24, 2025Updated 9 months ago
- Sparse Autoencoder for Mechanistic Interpretability☆291Jul 20, 2024Updated last year
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- ☆146Dec 30, 2025Updated last month
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆24Feb 6, 2025Updated last year
- ☆20Feb 17, 2023Updated 2 years ago
- ☆394Aug 21, 2025Updated 5 months ago
- Mechanistic Interpretability Visualizations using React☆320Dec 18, 2024Updated last year
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆240Dec 16, 2024Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Oct 6, 2024Updated last year
- A blog on AI, personal development, and living a good life.☆35Updated this week
- epsilon machines and transformers!☆34Jul 9, 2025Updated 7 months ago
- A library for efficient patching and automatic circuit discovery.☆88Dec 31, 2025Updated last month
- Attribution-based Parameter Decomposition☆33Jun 11, 2025Updated 8 months ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆41Updated this week
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 8 months ago
- Training code for Sparse Autoencoders on Embedding models☆39Feb 27, 2025Updated 11 months ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆153Updated this week
- ☆14Dec 6, 2025Updated 2 months ago
- Project to Accompany my YouTube Video on this topic☆11Sep 28, 2024Updated last year
- Mastodoner is a command line tool (and Python library) for archiving Mastodon, a decentralized micro-blogging social network.☆13Oct 21, 2024Updated last year
- Run a raffle among the 🌟 stargazers 🌟 of a Github project!☆11Mar 23, 2023Updated 2 years ago
- Turn data into meaningful insights quickly using Promptbooks, our collaborative AI-powered notebooks for data analysis.☆13Jan 21, 2025Updated last year
- Trains small LMs. Designed for training on SimpleStories☆12Sep 15, 2025Updated 5 months ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 8 months ago
- Sample notebooks for Juno☆11Mar 1, 2025Updated 11 months ago
- AlgZoo: uninterpreted models with fewer than 1,500 parameters☆41Jan 19, 2026Updated 3 weeks ago
- Create your React Native App in days, not months • incl. Figma File☆11Mar 14, 2025Updated 11 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆567Aug 7, 2025Updated 6 months ago
- Guide to Installing Ragflow on Google Cloud Compute Engine☆13Sep 12, 2024Updated last year
- Displays interlinear gloss in a more readable way with HTML.☆10Apr 2, 2019Updated 6 years ago
- Our work on Reinforcement learning that we share with the rest of the world☆13Jan 7, 2019Updated 7 years ago