yizhe-ang / interactive-transformerView external linksLinks
A visual interface for understanding and interpreting Transformers
☆77Oct 21, 2023Updated 2 years ago
Alternatives and similar repositories for interactive-transformer
Users that are interested in interactive-transformer are comparing it to the libraries listed below
Sorting:
- ☆40Jul 26, 2024Updated last year
- ☆21Mar 3, 2025Updated 11 months ago
- Efficiently computing & storing token n-grams from large corpora☆26Oct 6, 2024Updated last year
- A flexible and decentralized Ethereum blockchain explorer☆11Aug 14, 2022Updated 3 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Interpretability analysis of language model outlier and attempts to distill the model☆13May 8, 2023Updated 2 years ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Inspired by George Hotz☆12Oct 21, 2021Updated 4 years ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆29Feb 6, 2026Updated last week
- ☆16Oct 29, 2022Updated 3 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Interpretating the latent space representations of attention head outputs for LLMs☆36Aug 13, 2024Updated last year
- Open huff is a library for secure Huff smart contract development.☆21Nov 27, 2022Updated 3 years ago
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆203Dec 22, 2021Updated 4 years ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Jan 19, 2024Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆567Aug 7, 2025Updated 6 months ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 8 months ago
- ☆51Jan 28, 2024Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Mapping out the "memory" of neural nets with data attribution☆39Updated this week
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated 11 months ago
- ☆23Jun 18, 2024Updated last year
- An experimental Solidity-based web framework☆24Jan 1, 2022Updated 4 years ago
- Train a neural chatbot to imitate your personality (or the personalities of your contacts) using your Facebook and Skype messaging histor…☆22Jul 21, 2019Updated 6 years ago
- Benchmarks of popular contract implementations in solidity☆105Jul 28, 2024Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Jun 3, 2024Updated last year
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Oct 13, 2024Updated last year
- Experimental GPU language with meta-programming☆25Sep 6, 2024Updated last year
- train with kittens!☆63Oct 25, 2024Updated last year
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Feb 27, 2025Updated 11 months ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆25Aug 31, 2025Updated 5 months ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆32Jun 5, 2025Updated 8 months ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Foundry scripts to automate and keep track of deployments and proxy upgrades.☆77Jun 30, 2023Updated 2 years ago
- Find a optimized name for method.☆33Aug 27, 2022Updated 3 years ago
- ☆34Sep 10, 2024Updated last year