inseq-team / inseqLinks
Interpretability for sequence generation models π π
β447Updated last month
Alternatives and similar repositories for inseq
Users that are interested in inseq are comparing it to the libraries listed below
Sorting:
- Tools for understanding how transformer predictions are built layer-by-layerβ549Updated 3 months ago
- Stanford NLP Python library for understanding and improving PyTorch models via interventionsβ834Updated last month
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsβ224Updated last year
- β393Updated last week
- Repository for research in the field of Responsible NLP at Meta.β202Updated 6 months ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretabilityβ60Updated last year
- Utilities for the HuggingFace transformers libraryβ72Updated 2 years ago
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"β456Updated 2 years ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.β180Updated 3 years ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.β301Updated last year
- This repository collects all relevant resources about interpretability in LLMsβ385Updated last year
- A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.β105Updated 2 years ago
- Mechanistic Interpretability Visualizations using Reactβ302Updated 11 months ago
- PAIR.withgoogle.com and friend's work on interpretability methodsβ214Updated this week
- Datasets collection and preprocessings framework for NLP extreme multitask learningβ189Updated 4 months ago
- Erasing concepts from neural representations with provable guaranteesβ239Updated 10 months ago
- β255Updated last year
- Aligning AI With Shared Human Values (ICLR 2021)β304Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.β105Updated 2 years ago
- Mistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Fβ¦β577Updated 2 years ago
- β237Updated last year
- β283Updated last year
- Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)β313Updated 3 years ago
- Sparse probing paper full code.β65Updated last year
- How do transformer LMs encode relations?β57Updated last year
- The nnsight package enables interpreting and manipulating the internals of deep learned models.β707Updated this week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).β228Updated 11 months ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including anβ¦β282Updated 3 years ago
- β83Updated 9 months ago
- Locating and editing factual associations in GPT (NeurIPS 2022)β698Updated last year