VectorInstitute / kaleidoscope-sdk

A user toolkit for analyzing and interfacing with Large Language Models (LLMs)

☆24

Related projects ⓘ

Alternatives and complementary repositories for kaleidoscope-sdk

VectorInstitute / vectorlm
LLM finetuning in resource-constrained environments.
☆41Updated 4 months ago
VectorInstitute / flex_model
☆12Updated 8 months ago
ndif-team / nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
☆400Updated this week
VectorInstitute / kaleidoscope
A user toolkit for analyzing and interfacing with Large Language Models (LLMs)
☆21Updated 2 months ago
VectorInstitute / vector-inference
Efficient LLM inference on Slurm clusters using vLLM.
☆38Updated last week
TransformerLensOrg / CircuitsVis
Mechanistic Interpretability Visualizations using React
☆196Updated 4 months ago
ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models
This repository collects all relevant resources about interpretability in LLMs
☆283Updated last week
AlignmentResearch / tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
☆429Updated 5 months ago
pomonam / kronfluence
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
☆100Updated 3 months ago
jbloomAus / SAELens
Training Sparse Autoencoders on Language Models
☆453Updated this week
neelnanda-io / 1L-Sparse-Autoencoder
☆108Updated last year
HoagyC / sparse_coding
Using sparse coding to find distributed representations used by neural networks.
☆181Updated last year
ArthurConmy / Automatic-Circuit-Discovery
☆187Updated last month
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆177Updated last month
inseq-team / inseq
Interpretability for sequence generation models 🐛 🔍
☆375Updated this week
saprmarks / dictionary_learning
☆141Updated 3 weeks ago
saprmarks / feature-circuits
☆102Updated last month
ai-safety-foundation / sparse_autoencoder
Sparse Autoencoder for Mechanistic Interpretability
☆188Updated 3 months ago
wesg52 / sparse-probing-paper
Sparse probing paper full code.
☆50Updated 10 months ago
lorenzkuhn / semantic_uncertainty
☆138Updated 4 months ago
Varal7 / conformal-language-modeling
Conformal Language Modeling
☆22Updated 10 months ago
rachtibat / LRP-eXplains-Transformers
Layer-Wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]
☆97Updated this week
davidbau / baukit
☆168Updated 8 months ago
epfml / llm-baselines
NanoGPT-like codebase for LLM training
☆73Updated this week
facebookresearch / ResponsibleNLP
Repository for research in the field of Responsible NLP at Meta.
☆183Updated 2 months ago
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆158Updated last month
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆52Updated last month
stanfordnlp / pyvene
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
☆633Updated last week
KihoPark / linear_rep_geometry
☆75Updated 9 months ago
r-three / t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆430Updated last year