yizhe-ang / interactive-transformerLinks

A visual interface for understanding and interpreting Transformers

☆77

Alternatives and similar repositories for interactive-transformer

Users that are interested in interactive-transformer are comparing it to the libraries listed below

Sorting:

xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆156Updated 2 years ago
thesephist / spectre
Sparse autoencoders for Contra text embedding models
☆25Updated last year
srush / GPTWorld
A puzzle to learn about prompting
☆132Updated 2 years ago
ishan0102 / rsrch.space
Stream of my favorite papers and links
☆42Updated 4 months ago
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆74Updated last year
teknium1 / transformers-gptq-quant
☆47Updated last year
google-deepmind / mishax
☆136Updated 4 months ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆114Updated last week
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 11 months ago
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆111Updated last year
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆78Updated this week
LeonGuertler / UnstableBaselines
☆96Updated last week
recmo / cria
Tiny inference-only implementation of LLaMA
☆93Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 5 months ago
Nearcyan / papers.day
papers.day
☆91Updated last year
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 5 months ago
srush / raspy
An interactive exploration of Transformer programming.
☆267Updated last year
RobertRiachi / nanoPALM
☆143Updated 2 years ago
yacineMTB / just-large-models
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Updated last year
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated 3 weeks ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 6 months ago
joshuacnf / Ctrl-G
☆88Updated 7 months ago
naklecha / factorio-automation
i will automate factorio
☆108Updated last year
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 3 months ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆72Updated 5 months ago