vgel / repengLinks

A library for making RepE control vectors

☆615

Alternatives and similar repositories for repeng

Users that are interested in repeng are comparing it to the libraries listed below

Sorting:

EGjoni / DRUGS
Stop messing around with finicky sampling parameters and just use DRµGS!
☆349Updated last year
NousResearch / Open-Reasoning-Tasks
A comprehensive repository of reasoning tasks for LLMs (and beyond)
☆447Updated 9 months ago
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆366Updated 5 months ago
carlini / yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
☆1,022Updated 2 months ago
SkunkworksAI / hydra-moe
☆415Updated last year
Mihaiii / llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆240Updated 4 months ago
vec2text / vec2text
utilities for decoding deep representations (like sentence embeddings) back to text
☆844Updated last month
EleutherAI / sparsify
Sparsify transformers with SAEs and transcoders
☆584Updated last week
rgreenblatt / arc_draw_more_samples_pub
Draw more samples
☆192Updated last year
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated last week
cognitivecomputations / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆618Updated 3 months ago
FastEval / FastEval
Fast & more realistic evaluation of chat language models. Includes leaderboard.
☆187Updated last year
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆649Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Updated last year
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆282Updated 4 months ago
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
alasdairforsythe / tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
☆588Updated last year
Leeroo-AI / mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
☆485Updated 10 months ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆230Updated 5 months ago
sam-paech / antislop-sampler
☆307Updated 3 months ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆714Updated last year
FailSpy / abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
☆484Updated last year
jondurbin / bagel
A bagel, with everything.
☆322Updated last year
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆516Updated last year
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆206Updated 7 months ago
TransluceAI / observatory
A toolkit for describing model features and intervening on those features to steer behavior.
☆191Updated 8 months ago
aidanmclaughlin / AidanBench
Aidan Bench attempts to measure <big_model_smell> in LLMs.
☆306Updated 3 weeks ago
xjdr-alt / entropix-local
smol models are fun too
☆93Updated 8 months ago