EleutherAI / bergsonLinks
Mapping out the "memory" of neural nets with data attribution
☆29Updated this week
Alternatives and similar repositories for bergson
Users that are interested in bergson are comparing it to the libraries listed below
Sorting:
- Attribution-based Parameter Decomposition☆31Updated 4 months ago
- ☆142Updated last month
- ☆36Updated last year
- Engine for collecting, uploading, and downloading model activations☆24Updated 6 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆59Updated last year
- ☆29Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆219Updated last week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆222Updated 10 months ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- Sparse Autoencoder Training Library☆55Updated 5 months ago
- ☆128Updated 2 years ago
- Applying SAEs for fine-grained control☆24Updated 10 months ago
- Experiments for efforts to train a new and improved t5☆75Updated last year
- ☆74Updated 2 weeks ago
- Extract full next-token probabilities via language model APIs☆247Updated last year
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆129Updated 3 years ago
- ☆37Updated 8 months ago
- ☆54Updated 11 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Updated 9 months ago
- nanoGPT-like codebase for LLM training☆109Updated 5 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆82Updated 11 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 6 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆64Updated 3 weeks ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆47Updated 3 months ago
- Utilities for the HuggingFace transformers library☆72Updated 2 years ago
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆35Updated this week
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆90Updated 2 months ago
- code for training & evaluating Contextual Document Embedding models☆199Updated 5 months ago
- Code for Zero-Shot Tokenizer Transfer☆138Updated 9 months ago