acsresearch / interlabLinks

☆22

Alternatives and similar repositories for interlab

Users that are interested in interlab are comparing it to the libraries listed below

Sorting:

moirage / alignment-research-dataset
A dataset of alignment research and code to reproduce it
☆78Updated 2 years ago
METR / vivaria
Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
☆121Updated 2 weeks ago
oughtinc / primer
Factored Cognition Primer: How to write compositional language model programs
☆50Updated 2 years ago
google-deepmind / dangerous-capability-evaluations
☆62Updated 2 months ago
timaeus-research / devinterp
Tools for studying developmental interpretability in neural networks.
☆114Updated 5 months ago
MinhxLe / subliminal-learning
☆104Updated 3 months ago
mukobi / welfare-diplomacy
General-Sum variant of the game Diplomacy for evaluating AIs.
☆32Updated last year
METR / task-standard
METR Task Standard
☆168Updated 9 months ago
aypan17 / machiavelli
☆142Updated 4 months ago
METR / public-tasks
☆108Updated last week
poking-agents / modular-public
☆32Updated 5 months ago
neoneye / arc-notes
My writings about ARC (Abstraction and Reasoning Corpus)
☆86Updated 3 weeks ago
safety-research / safety-tooling
Inference API for many LLMs and other useful tools for empirical research
☆80Updated last week
TomFrederik / unseal
Mechanistic Interpretability for Transformer Models
☆53Updated 3 years ago
LRudL / evalugator
(Model-written) LLM evals library
☆18Updated 11 months ago
UKGovernmentBEIS / control-arena
ControlArena is a collection of settings, model organisms and protocols - for running control experiments.
☆129Updated this week
victorvikram / ConceptARC
Materials for ConceptARC paper
☆108Updated last year
safety-research / safety-examples
☆19Updated 2 weeks ago
EleutherAI / elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆214Updated last week
UKGovernmentBEIS / hibayes
☆35Updated last month
oughtinc / patchwork
Command-line recursive question-answering with immutable contexts and explicit data store
☆26Updated 7 years ago
leap-laboratories / PIZZA
An attribution library for LLMs
☆46Updated last year
alan-turing-institute / prompto
An open source library for asynchronous querying of LLM endpoints
☆34Updated 4 months ago
adamimos / epsilon-transformers
epsilon machines and transformers!
☆33Updated 4 months ago
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 9 months ago
anthropics / evals
☆313Updated last year
KihoPark / LLM_Categorical_Hierarchical_Representations
☆111Updated 9 months ago
EffiSciencesResearch / ML4G
Machine Learning for Alignment Bootcamp
☆25Updated last year
redwoodresearch / mlab
Machine Learning for Alignment Bootcamp
☆81Updated 3 years ago
safety-research / false-facts
☆25Updated 4 months ago