nostalgebraist / transformer-utils
Utilities for the HuggingFace transformers library
☆67Updated 2 years ago
Alternatives and similar repositories for transformer-utils:
Users that are interested in transformer-utils are comparing it to the libraries listed below
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆73Updated last year
- ☆121Updated last year
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 3 years ago
- A library for efficient patching and automatic circuit discovery.☆62Updated last month
- A library for finding knowledge neurons in pretrained transformer models.☆155Updated 3 years ago
- ☆214Updated 6 months ago
- Sparse probing paper full code.☆55Updated last year
- ☆114Updated 7 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆91Updated last month
- Mechanistic Interpretability Visualizations using React☆235Updated 3 months ago
- Erasing concepts from neural representations with provable guarantees☆226Updated 2 months ago
- Experiments with representation engineering☆11Updated last year
- ☆23Updated last month
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆90Updated 3 years ago
- ☆66Updated 4 months ago
- How do transformer LMs encode relations?☆46Updated last year
- ☆89Updated last month
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆165Updated last week
- ☆73Updated 11 months ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆40Updated last month
- ☆38Updated last year
- ☆82Updated 7 months ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆197Updated last week
- ☆44Updated 4 months ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆25Updated last year
- ☆26Updated 11 months ago
- ☆54Updated last year
- The evaluation pipeline for the 2024 BabyLM Challenge.☆29Updated 4 months ago
- A framework for few-shot evaluation of autoregressive language models.☆103Updated last year