KoyenaPal / future-lensLinks

Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State

☆20

Alternatives and similar repositories for future-lens

Users that are interested in future-lens are comparing it to the libraries listed below

Sorting:

LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆71Updated last year
ApolloResearch / e2e_sae
Sparse Autoencoder Training Library
☆55Updated 7 months ago
UFO-101 / auto-circuit
A library for efficient patching and automatic circuit discovery.
☆80Updated 4 months ago
Nix07 / finetuning
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…
☆28Updated last month
EleutherAI / elk-generalization
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…
☆28Updated last year
neelsjain / BYOD
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆107Updated 2 years ago
XiangLi1999 / AutoBencher
☆32Updated last year
ckkissane / crosscoder-model-diff-replication
Open source replication of Anthropic's Crosscoders for Model Diffing
☆62Updated last year
saprmarks / geometry-of-truth
☆95Updated last year
EleutherAI / semantic-memorization
☆44Updated last year
ruiqi-zhong / D5
The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions
☆71Updated 2 years ago
allenai / bff
☆38Updated last year
adamkarvonen / SAE_BoardGameEval
☆23Updated 10 months ago
hadasah / btm
☆76Updated last year
guy-dar / embedding-space
☆56Updated 2 years ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆100Updated last year
meg-tong / sycophancy-eval
datasets from the paper "Towards Understanding Sycophancy in Language Models"
☆97Updated 2 years ago
mcleish7 / gemstone-scaling-laws
Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
☆30Updated 2 months ago
hughbzhang / o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
☆90Updated last year
abhishekpanigrahi1996 / transformer_in_transformer
☆45Updated 2 years ago
CarperAI / autocrit
A repository for transformer critique learning and generation
☆89Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆61Updated last year
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆76Updated last year
evandez / REMEDI
Inspecting and Editing Knowledge Representations in Language Models
☆119Updated 2 years ago
montemac / activation_additions
Algebraic value editing in pretrained language models
☆66Updated 2 years ago
kaistAI / factual-knowledge-acquisition
☆23Updated 2 weeks ago
TristanThrush / perplexity-correlations
Simple and scalable tools for data-driven pretraining data selection.
☆29Updated 5 months ago
google / belief-localization
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…
☆60Updated 2 years ago
zeyuyun1 / TransformerVis
☆43Updated 4 years ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Updated last year