dmis-lab / MonetLinks
[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
โ68Updated 4 months ago
Alternatives and similar repositories for Monet
Users that are interested in Monet are comparing it to the libraries listed below
Sorting:
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ51Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methodsโ89Updated last week
- Open source replication of Anthropic's Crosscoders for Model Diffingโ55Updated 7 months ago
- โ26Updated last year
- Function Vectors in Large Language Models (ICLR 2024)โ167Updated last month
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entityโฆโ25Updated last year
- โ52Updated last year
- Sparse Autoencoder Training Libraryโ50Updated last month
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMsโ87Updated 6 months ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)โ23Updated 3 months ago
- โ79Updated 9 months ago
- Language models scale reliably with over-training and on downstream tasksโ97Updated last year
- โ50Updated 2 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"โ75Updated 6 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Modelโ43Updated last year
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)โ60Updated last year
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"โ26Updated 7 months ago
- โ43Updated 6 months ago
- Official implementation of "BERTs are Generative In-Context Learners"โ28Updated 2 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)โ35Updated 6 months ago
- Universal Neurons in GPT2 Language Modelsโ29Updated last year
- โ61Updated 2 months ago
- โ94Updated last year
- โ179Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".โ56Updated 2 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionโ89Updated 7 months ago
- PyTorch library for Active Fine-Tuningโ77Updated 3 months ago
- โ97Updated 11 months ago
- A library for efficient patching and automatic circuit discovery.โ65Updated last month
- Unofficial Implementation of Chain-of-Thought Reasoning Without Promptingโ32Updated last year