EleutherAI / bergsonLinks
Mapping out the "memory" of neural nets with data attribution
☆37Updated this week
Alternatives and similar repositories for bergson
Users that are interested in bergson are comparing it to the libraries listed below
Sorting:
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆59Updated 6 months ago
- ☆150Updated 4 months ago
- Engine for collecting, uploading, and downloading model activations☆24Updated 9 months ago
- Attribution-based Parameter Decomposition☆33Updated 7 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆63Updated last year
- code for training & evaluating Contextual Document Embedding models☆202Updated 7 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆31Updated 8 months ago
- An introduction to LLM Sampling☆79Updated last year
- nanoGPT-like codebase for LLM training☆114Updated 2 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆235Updated 2 weeks ago
- Storing long contexts in tiny caches with self-study☆229Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆181Updated 6 months ago
- Code for Zero-Shot Tokenizer Transfer☆142Updated 11 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Updated last year
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆96Updated 5 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆236Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 8 months ago
- Applying SAEs for fine-grained control☆25Updated last year
- ☆90Updated 6 months ago
- ☆58Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated last month
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆74Updated 6 months ago
- Extract full next-token probabilities via language model APIs☆248Updated last year
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- Understand and test language model architectures on synthetic tasks.☆248Updated 3 months ago
- ☆83Updated 3 weeks ago
- PyTorch library for Active Fine-Tuning☆96Updated 3 months ago
- PageRank for LLMs☆51Updated 4 months ago
- ☆32Updated 11 months ago