yizhe-ang / interactive-transformerView external linksLinks
A visual interface for understanding and interpreting Transformers
☆77Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for interactive-transformer
Users that are interested in interactive-transformer are comparing it to the libraries listed below
Sorting:
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- ☆40Jul 26, 2024Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Oct 6, 2024Updated last year
- A flexible and decentralized Ethereum blockchain explorer☆11Aug 14, 2022Updated 3 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- ☆12Jan 4, 2024Updated 2 years ago
- Inspired by George Hotz☆12Oct 21, 2021Updated 4 years ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆29Feb 6, 2026Updated last week
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Interpretability analysis of language model outlier and attempts to distill the model☆13May 8, 2023Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆567Aug 7, 2025Updated 6 months ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 8 months ago
- ☆51Jan 28, 2024Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Mapping out the "memory" of neural nets with data attribution☆39Feb 3, 2026Updated last week
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated 11 months ago
- Optimint is a gas-optimized ERC721 reference implementation that's based on OpenZeppelin Contracts, and requires only 1 SSTORE operation …☆28Jul 27, 2022Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Oct 13, 2024Updated last year
- train with kittens!☆63Oct 25, 2024Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 8 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Feb 27, 2025Updated 11 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- ☆60Mar 8, 2022Updated 3 years ago
- An alternative UI for squeeths☆28Nov 4, 2024Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- Royalty Management System for NFTs with EIP-2981 royalties☆33Apr 15, 2022Updated 3 years ago
- 🧠 Starter templates for doing interpretability research☆76Jul 16, 2023Updated 2 years ago
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆46Sep 2, 2025Updated 5 months ago
- ☆33Nov 4, 2024Updated last year
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- Writing FLUX in Triton☆41Sep 22, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago