apartresearch / Neuron2GraphLinks
Tools for exploring Transformer neuron behaviour, including input pruning and diversification.
☆20Updated 2 years ago
Alternatives and similar repositories for Neuron2Graph
Users that are interested in Neuron2Graph are comparing it to the libraries listed below
Sorting:
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆45Updated 11 months ago
- Code for 'Emergent Analogical Reasoning in Large Language Models'☆51Updated last year
- ☆27Updated 2 years ago
- Materials for ConceptARC paper☆102Updated 11 months ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆191Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
- ☆137Updated 2 months ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆108Updated last year
- ☆211Updated 2 years ago
- Extracting spatial and temporal world models from LLMs☆257Updated 2 years ago
- ☆120Updated last year
- ☆69Updated 3 years ago
- ☆109Updated 8 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆128Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆94Updated last year
- A virtual environment for developing and evaluating automated scientific discovery agents.☆188Updated 7 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆71Updated 2 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated last year
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆130Updated 11 months ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆19Updated last year
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆208Updated last week
- [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
- ☆77Updated last year
- ⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆86Updated last month
- A repository for transformer critique learning and generation☆88Updated last year
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆213Updated 4 months ago
- Extending Conformal Prediction to LLMs☆68Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆42Updated 7 months ago
- ☆69Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆117Updated 2 years ago