inseq-team / inseq
Interpretability for sequence generation models 🐛 🔍
☆401Updated 3 months ago
Alternatives and similar repositories for inseq:
Users that are interested in inseq are comparing it to the libraries listed below
- Tools for understanding how transformer predictions are built layer-by-layer☆475Updated 8 months ago
- This repository collects all relevant resources about interpretability in LLMs☆321Updated 3 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆175Updated last month
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆206Updated 3 months ago
- Mechanistic Interpretability Visualizations using React☆232Updated 2 months ago
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆698Updated this week
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 4 months ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆443Updated last year
- Repository for research in the field of Responsible NLP at Meta.☆194Updated 2 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆182Updated 2 months ago
- Erasing concepts from neural representations with provable guarantees☆222Updated 3 weeks ago
- Tools for checking ACL paper submissions☆667Updated 4 months ago
- ☆203Updated 4 months ago
- ☆190Updated 11 months ago
- ☆220Updated last week
- PAIR.withgoogle.com and friend's work on interpretability methods☆167Updated last week
- ☆109Updated 6 months ago
- How do transformer LMs encode relations?☆46Updated 11 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆328Updated last year
- Training Sparse Autoencoders on Language Models☆619Updated this week
- Using sparse coding to find distributed representations used by neural networks.☆213Updated last year
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆178Updated 2 years ago
- ☆262Updated 11 months ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆131Updated 2 months ago
- ☆65Updated last year
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆462Updated last year
- ☆210Updated 2 weeks ago
- Multilingual Large Language Models Evaluation Benchmark☆117Updated 6 months ago
- A library for finding knowledge neurons in pretrained transformer models.☆154Updated 3 years ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆789Updated 6 months ago