phys-ai / concept_graphsLinks
☆14Updated last year
Alternatives and similar repositories for concept_graphs
Users that are interested in concept_graphs are comparing it to the libraries listed below
Sorting:
- ☆56Updated 11 months ago
- ☆112Updated 10 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆108Updated 2 years ago
- ☆58Updated last year
- ☆138Updated this week
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆44Updated last year
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆77Updated last year
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆70Updated 9 months ago
- ViT Prisma is a mechanistic interpretability library for Vision and Video Transformers (ViTs).☆331Updated 5 months ago
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆51Updated 5 months ago
- Concept Learning Dynamics☆16Updated last year
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling" (NeurIPS 2025).☆54Updated 3 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆63Updated 9 months ago
- Modified to support crosscoder training.☆25Updated 2 months ago
- Sparse Autoencoders for Stable Diffusion XL models.☆79Updated last month
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆41Updated last year
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆25Updated last year
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆35Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆202Updated 6 months ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆149Updated 10 months ago
- ☆89Updated 9 months ago
- Tools for optimizing steering vectors in LLMs.☆15Updated 8 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆117Updated 2 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆188Updated 8 months ago
- Sparse Autoencoder Training Library☆56Updated 7 months ago
- ☆373Updated 4 months ago
- ☆223Updated last year
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆175Updated 6 months ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆27Updated 10 months ago
- [ICML 2025] Unlearning in Diffusion Models using Sparse Autoencoders☆49Updated 2 months ago