Official repo for the paper "Bilinear MLPs enable weight-based mechanistic interpretability".
☆30Aug 2, 2025Updated 8 months ago
Alternatives and similar repositories for bilinear-decomposition
Users that are interested in bilinear-decomposition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "A Principled Framework for Multi-View Contrastive Learning"☆20Jul 10, 2025Updated 9 months ago
- ☆33Feb 11, 2025Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆19Apr 5, 2026Updated last week
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆20Dec 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the experiments and websites of the paper "Same Task, Different Circuits"☆34Oct 21, 2025Updated 5 months ago
- ☆66Jan 13, 2022Updated 4 years ago
- ☆64Apr 25, 2020Updated 5 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- The source of MNER-MI.☆19Dec 17, 2024Updated last year
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆46Mar 27, 2024Updated 2 years ago
- This repository represents a basic implementation of the paper "Riemannian Geometry of Deep Generative Models", along with the results on…☆12Oct 23, 2019Updated 6 years ago
- ☆15Oct 19, 2024Updated last year
- Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)☆27Mar 26, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 8 months ago
- ☆16Apr 14, 2021Updated 5 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆81Oct 3, 2024Updated last year
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- ☆19Mar 25, 2025Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆87Feb 27, 2024Updated 2 years ago
- Explaining ML models using LLMs☆24Oct 21, 2024Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆84Nov 27, 2024Updated last year
- ☆29May 4, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 11 months ago
- Docker compose for starting local OpenML instances☆11Jan 13, 2023Updated 3 years ago
- [NeurIPS25 Spotlight] Official Implementation for CBSA (Contract-and-Broadcast Self-Attention)☆36Apr 3, 2026Updated 2 weeks ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆23Jul 10, 2025Updated 9 months ago
- ☆23Dec 31, 2020Updated 5 years ago
- ☆15Aug 3, 2021Updated 4 years ago
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆19Oct 11, 2024Updated last year
- YOLOv3 implemented in Julia with Knet deep learning framework.☆12Jan 15, 2021Updated 5 years ago
- ☆12Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Papers about training data quality management for ML models.☆115Apr 1, 2026Updated 2 weeks ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- Get the css from a domain and timestamp via the wayback machine☆18Aug 17, 2017Updated 8 years ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆17Mar 13, 2025Updated last year
- Interlink macvim & skim for an integrated LaTeX DE☆17Mar 14, 2016Updated 10 years ago
- A fun way to visualize influence in the game of Go.☆23Dec 13, 2018Updated 7 years ago
- ☆16Oct 4, 2024Updated last year