AugustasMacijauskas / trailtoken
An application that visualises LLM tokenizers
☆10Updated 8 months ago
Alternatives and similar repositories for trailtoken
Users that are interested in trailtoken are comparing it to the libraries listed below
Sorting:
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Updated last month
- Erasing concepts from neural representations with provable guarantees☆228Updated 3 months ago
- Einsum with einops style variable names☆16Updated last year
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆310Updated 2 years ago
- A collection of Italian benchmarks for LLM evaluation☆30Updated last month
- Benchmarks for the Evaluation of LLM Supervision☆32Updated last month
- Mechanistic Interpretability Visualizations using React☆245Updated 4 months ago
- ☆37Updated 2 years ago
- Code to reproduce data for Bias in Bios☆46Updated last year
- Annotated corpus + evaluation metrics for text anonymisation☆55Updated last year
- An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"☆312Updated 8 months ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated 2 years ago
- The Happy Faces Benchmark☆15Updated last year
- An interactive exploration of Transformer programming.☆264Updated last year
- Complete set of English dialect transformation rules and evaluation code☆15Updated 11 months ago
- Fairness toolkit for pytorch, scikit learn and autogluon☆32Updated 5 months ago
- Adversarial Natural Language Inference Benchmark☆393Updated 3 years ago
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆125Updated 6 months ago
- ☆62Updated last year
- ☆269Updated 10 months ago
- git extension for {collaborative, communal, continual} model development☆213Updated 6 months ago
- ☆62Updated 2 years ago
- Silly twitter torch implementations.☆46Updated 2 years ago
- ☆89Updated 2 years ago
- Tools for studying developmental interpretability in neural networks.☆89Updated 3 months ago
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆15Updated 7 months ago
- ☆81Updated 10 months ago
- Machine Learning for Alignment Bootcamp☆25Updated last year
- PAIR.withgoogle.com and friend's work on interpretability methods☆188Updated 2 weeks ago
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago