inseq-team / inseqLinks
Interpretability for sequence generation models 🐛 🔍
☆426Updated 2 months ago
Alternatives and similar repositories for inseq
Users that are interested in inseq are comparing it to the libraries listed below
Sorting:
- Tools for understanding how transformer predictions are built layer-by-layer☆503Updated last year
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆764Updated last month
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆221Updated 7 months ago
- Repository for research in the field of Responsible NLP at Meta.☆200Updated last month
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆177Updated 3 years ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆57Updated last year
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆452Updated last year
- Utilities for the HuggingFace transformers library☆68Updated 2 years ago
- ☆288Updated this week
- Erasing concepts from neural representations with provable guarantees☆230Updated 5 months ago
- This repository collects all relevant resources about interpretability in LLMs☆362Updated 8 months ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.☆293Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆160Updated 3 weeks ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆184Updated this week
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆563Updated last year
- Aligning AI With Shared Human Values (ICLR 2021)☆289Updated 2 years ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆194Updated 2 weeks ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 9 months ago
- ☆231Updated 9 months ago
- Mechanistic Interpretability Visualizations using React☆260Updated 6 months ago
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆463Updated 2 years ago
- ☆78Updated 4 months ago
- ☆273Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆139Updated 6 months ago
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆605Updated this week
- StereoSet: Measuring stereotypical bias in pretrained language models☆184Updated 2 years ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆310Updated 2 years ago
- A library for finding knowledge neurons in pretrained transformer models.☆158Updated 3 years ago
- ☆237Updated 3 months ago