RoyRin / arxiv_connectionsLinks
interactively identify related Authors on arxiv
☆14Updated 2 years ago
Alternatives and similar repositories for arxiv_connections
Users that are interested in arxiv_connections are comparing it to the libraries listed below
Sorting:
- Tools for studying developmental interpretability in neural networks.☆105Updated 3 months ago
- Mechanistic Interpretability for Transformer Models☆53Updated 3 years ago
- Neural Networks and the Chomsky Hierarchy☆209Updated last year
- 🧠 Starter templates for doing interpretability research☆75Updated 2 years ago
- Erasing concepts from neural representations with provable guarantees☆236Updated 8 months ago
- The Happy Faces Benchmark☆15Updated 2 years ago
- small language models training made easy☆13Updated 10 months ago
- Utilities for the HuggingFace transformers library☆72Updated 2 years ago
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆199Updated 3 years ago
- ☆128Updated last year
- ☆278Updated last year
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 4 years ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆191Updated 2 years ago
- Python library for argument and configuration management☆55Updated 2 years ago
- Psych-GA.2207 Categories and Concepts☆17Updated last year
- Mechanistic Interpretability Visualizations using React☆291Updated 9 months ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆31Updated last year
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆41Updated 8 months ago
- Redwood Research's transformer interpretability tools☆14Updated 3 years ago
- Official repository for CMU Machine Learning Department's 10721: "Philosophical Foundations of Machine Intelligence".☆262Updated 2 years ago
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
- JAX Synergistic Memory Inspector☆179Updated last year
- Named tensors with first-class dimensions for PyTorch☆331Updated 2 years ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆79Updated 3 years ago
- Interpreting how transformers simulate agents performing RL tasks☆88Updated last year
- Stochastic Parameter Decomposition☆49Updated this week
- Einsum with einops style variable names☆17Updated last year
- Codebase for Mechanistic Mode Connectivity☆14Updated 2 years ago