Official repo for the paper "Bilinear MLPs enable weight-based mechanistic interpretability".
☆28Aug 2, 2025Updated 7 months ago
Alternatives and similar repositories for bilinear-decomposition
Users that are interested in bilinear-decomposition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆18Mar 17, 2026Updated last week
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year
- [NeurIPS'24 Oral] Official repository for the paper "Scale Equivariant Graph Metanetworks"☆24Nov 24, 2024Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆20Dec 14, 2024Updated last year
- ☆65Jan 13, 2022Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆13Feb 2, 2023Updated 3 years ago
- A summary of must-read papers for Neural Question Generation (NQG)☆14Nov 14, 2020Updated 5 years ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 7 months ago
- ☆16Apr 14, 2021Updated 4 years ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆81Oct 3, 2024Updated last year
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- ☆19Mar 25, 2025Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆87Feb 27, 2024Updated 2 years ago
- Explaining ML models using LLMs☆24Oct 21, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆84Nov 27, 2024Updated last year
- Official PyTorch implementation of "A Unified Framework for U-Net Design and Analysis" (NeurIPS 2023).☆20Sep 11, 2023Updated 2 years ago
- ☆11Jun 20, 2023Updated 2 years ago
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 10 months ago
- Docker compose for starting local OpenML instances☆11Jan 13, 2023Updated 3 years ago
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Jul 10, 2025Updated 8 months ago
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆19Oct 11, 2024Updated last year
- ☆15Aug 3, 2021Updated 4 years ago
- A simple PyTorch implementation of influence functions.☆92Jun 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Get the css from a domain and timestamp via the wayback machine☆18Aug 17, 2017Updated 8 years ago
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- source code of (quasi-)Givens Orthogonal Fine Tuning integrated to peft lib☆17Mar 13, 2025Updated last year
- Interlink macvim & skim for an integrated LaTeX DE☆17Mar 14, 2016Updated 10 years ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆11Oct 23, 2023Updated 2 years ago
- [CVPR'25] AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data☆17Mar 27, 2025Updated last year
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆61Feb 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆14Jan 12, 2026Updated 2 months ago
- Class of data structures that can be unfolded.☆22Jan 6, 2026Updated 2 months ago
- ☆15Feb 25, 2018Updated 8 years ago
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆18Oct 17, 2022Updated 3 years ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆76Jun 23, 2025Updated 9 months ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆25Mar 4, 2025Updated last year