google-deepmind / mishaxLinks

☆138

Alternatives and similar repositories for mishax

Users that are interested in mishax are comparing it to the libraries listed below

Sorting:

goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 4 months ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆191Updated last year
EleutherAI / delphi
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …
☆206Updated this week
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆152Updated last month
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆129Updated 2 years ago
Aleph-Alpha-Research / scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆64Updated 9 months ago
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆77Updated 8 months ago
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆211Updated 8 months ago
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆150Updated 6 months ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆89Updated 6 months ago
METR / RE-Bench
☆96Updated 3 months ago
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆228Updated last month
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆197Updated 3 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 10 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 7 months ago
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆177Updated 5 months ago
epfml / llm-baselines
nanoGPT-like codebase for LLM training
☆102Updated 3 months ago
joshuacnf / Ctrl-G
☆89Updated 7 months ago
TransluceAI / observatory
A toolkit for describing model features and intervening on those features to steer behavior.
☆197Updated 9 months ago
ApolloResearch / e2e_sae
Sparse Autoencoder Training Library
☆54Updated 3 months ago
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
jbloomAus / SAEDashboard
☆66Updated last week
tilde-research / sieve
Applying SAEs for fine-grained control
☆23Updated 8 months ago
ckkissane / crosscoder-model-diff-replication
Open source replication of Anthropic's Crosscoders for Model Diffing
☆58Updated 9 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 4 months ago
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆267Updated 3 months ago
SalesforceAIResearch / LaTRO
☆120Updated 6 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆107Updated 4 months ago
amack315 / unsupervised-steering-vectors
☆32Updated last year
KihoPark / LLM_Categorical_Hierarchical_Representations
☆105Updated 6 months ago