pentagonalize / Transformer-Cookbook
☆12Updated 2 months ago
Alternatives and similar repositories for Transformer-Cookbook:
Users that are interested in Transformer-Cookbook are comparing it to the libraries listed below
- ☆11Updated 5 years ago
- Minimum Description Length Recurrent Neural Networks☆18Updated last year
- Minimum Description Length Recurrent Neural Networks (MDLRNNs) in PyTorch☆21Updated 2 weeks ago
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆14Updated 11 months ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆39Updated last month
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- ☆34Updated last year
- ☆89Updated last month
- ☆61Updated 2 years ago
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…☆18Updated last year
- ☆12Updated 2 weeks ago
- ☆19Updated 9 months ago
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- How do transformer LMs encode relations?☆46Updated last year
- Silly twitter torch implementations.☆46Updated 2 years ago
- Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning☆28Updated 3 years ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆41Updated 4 months ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- ☆19Updated last year
- ☆14Updated last year
- Utilities for the HuggingFace transformers library☆67Updated 2 years ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21Updated last year
- open source interpretability platform 🧠☆45Updated this week
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆26Updated 3 weeks ago
- Highlight errors in a bib file: missing URLs, capitalization protection, etc☆27Updated 10 months ago
- A highly sophisticated sequence-to-sequence model for code generation☆40Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆57Updated 10 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- LTG-Bert☆31Updated last year
- ☆10Updated 2 years ago