inseq-team / inseqLinks
Interpretability for sequence generation models 🐛 🔍
☆419Updated last month
Alternatives and similar repositories for inseq
Users that are interested in inseq are comparing it to the libraries listed below
Sorting:
- Repository for research in the field of Responsible NLP at Meta.☆200Updated 2 weeks ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆177Updated 3 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆493Updated 11 months ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆54Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆221Updated 6 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆182Updated 4 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆211Updated 8 months ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆189Updated last month
- Mechanistic Interpretability Visualizations using React☆251Updated 5 months ago
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆745Updated this week
- ☆222Updated 8 months ago
- A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.☆102Updated last year
- ☆209Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆156Updated 8 months ago
- Tools for checking ACL paper submissions☆725Updated 2 weeks ago
- ☆65Updated last year
- MEND: Fast Model Editing at Scale☆245Updated last year
- BARTScore: Evaluating Generated Text as Text Generation☆350Updated 2 years ago
- This repository collects all relevant resources about interpretability in LLMs☆353Updated 7 months ago
- Erasing concepts from neural representations with provable guarantees☆227Updated 4 months ago
- Aligning AI With Shared Human Values (ICLR 2021)☆287Updated 2 years ago
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)☆310Updated 2 years ago
- Utilities for the HuggingFace transformers library☆68Updated 2 years ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,346Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆136Updated 5 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆200Updated 5 months ago
- ☆269Updated last week
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆451Updated last year
- Sparse Autoencoder for Mechanistic Interpretability☆248Updated 10 months ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆786Updated last year