inseq-team / inseq
Interpretability for sequence generation models π π
β377Updated last week
Related projects β
Alternatives and complementary repositories for inseq
- Tools for understanding how transformer predictions are built layer-by-layerβ430Updated 5 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).β157Updated last month
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsβ190Updated this week
- This repository collects all relevant resources about interpretability in LLMsβ288Updated 2 weeks ago
- Training Sparse Autoencoders on Language Modelsβ469Updated this week
- Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventionsβ641Updated 2 weeks ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.β172Updated 2 years ago
- β170Updated 8 months ago
- Mechanistic Interpretability Visualizations using Reactβ198Updated 4 months ago
- Erasing concepts from neural representations with provable guaranteesβ209Updated last week
- A python package for benchmarking interpretability techniques on Transformers.β212Updated last month
- The nnsight package enables interpreting and manipulating the internals of deep learned models.β402Updated this week
- β188Updated last month
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ149Updated 4 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: β¦β323Updated last year
- A framework for few-shot evaluation of autoregressive language models.β101Updated last year
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including anβ¦β269Updated 2 years ago
- Using sparse coding to find distributed representations used by neural networks.β184Updated last year
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Modelβ465Updated last month
- Repository for research in the field of Responsible NLP at Meta.β186Updated this week
- β182Updated this week
- Multilingual Large Language Models Evaluation Benchmarkβ107Updated 3 months ago
- Sparse autoencodersβ342Updated last week
- A Survey on Data Selection for Language Modelsβ182Updated last month
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.β534Updated 5 months ago
- Extract full next-token probabilities via language model APIsβ229Updated 8 months ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"β431Updated last year
- Utilities for the HuggingFace transformers libraryβ61Updated last year
- List of papers on hallucination detection in LLMs.β678Updated 2 weeks ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627β461Updated last month