anthropics / attribution-graphs-frontendView external linksLinks
https://transformer-circuits.pub/2025/attribution-graphs/methods.html
☆91Mar 27, 2025Updated 10 months ago
Alternatives and similar repositories for attribution-graphs-frontend
Users that are interested in attribution-graphs-frontend are comparing it to the libraries listed below
Sorting:
- ☆17Dec 10, 2025Updated 2 months ago
- Display and customize Markdown text in SwiftUI☆33Jan 28, 2025Updated last year
- Experiments with representation engineering☆13Feb 28, 2024Updated last year
- Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy☆37Updated this week
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆203Dec 22, 2021Updated 4 years ago
- Training Sparse Autoencoders on Language Models☆1,201Updated this week
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Apr 18, 2024Updated last year
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆127Mar 9, 2024Updated last year
- A library for efficient patching and automatic circuit discovery.☆88Dec 31, 2025Updated last month
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆23Jan 15, 2026Updated 3 weeks ago
- Auditing agents for fine-tuning safety☆18Oct 21, 2025Updated 3 months ago
- ☆102Feb 4, 2026Updated last week
- A dataset of alignment research and code to reproduce it☆78Jun 22, 2023Updated 2 years ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆11Oct 23, 2023Updated 2 years ago
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated last year
- Sparsify transformers with SAEs and transcoders☆692Updated this week
- ☆17Aug 5, 2025Updated 6 months ago
- Trains small LMs. Designed for training on SimpleStories☆12Sep 15, 2025Updated 4 months ago
- Official frontend web application for Moltbook - The Social Network for AI Agents. Built with Next.js 14, TypeScript, Tailwind CSS featur…☆25Feb 1, 2026Updated last week
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Tusk Drift Demo - Node.js Service☆58Jan 20, 2026Updated 3 weeks ago
- DragMesh: Interactive 3D Generation Made Easy☆20Dec 28, 2025Updated last month
- ☆44Nov 17, 2024Updated last year
- ☆48Jan 21, 2024Updated 2 years ago
- open source interpretability platform 🧠☆704Updated this week
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Website☆12Updated this week
- The AI that helps you achieve your goals☆11Feb 4, 2024Updated 2 years ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- Customizable charts made with TikZ and LaTeX3☆14Feb 11, 2023Updated 3 years ago
- ☆18Jan 8, 2026Updated last month
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆26Nov 6, 2025Updated 3 months ago
- Generate stunning liquid glass effects like iOS 26. Customize Apple-style blur, transparency, and glow – copy the CSS & HTML instantly.☆15Jun 20, 2025Updated 7 months ago
- Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.☆18Jun 21, 2024Updated last year
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- ☆11Oct 22, 2025Updated 3 months ago