arnab-api / rombaLinks
Applies ROME and MEMIT on Mamba-S4 models
☆14Updated last year
Alternatives and similar repositories for romba
Users that are interested in romba are comparing it to the libraries listed below
Sorting:
- ☆18Updated last month
- ☆13Updated 2 months ago
- ☆20Updated last year
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆11Updated 7 months ago
- Self-Supervised Alignment with Mutual Information☆21Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 5 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- ☆19Updated 5 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆39Updated 9 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated last year
- ☆27Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 4 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 2 months ago
- Lightweight Adapting for Black-Box Large Language Models☆23Updated last year
- ☆26Updated 4 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆14Updated 5 months ago
- ☆16Updated last year
- ☆22Updated last year
- ☆20Updated 11 months ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆20Updated 3 months ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆22Updated 8 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆59Updated 2 weeks ago
- ☆15Updated last year
- ☆45Updated 3 months ago
- Exploration of automated dataset selection approaches at large scales.☆47Updated 5 months ago
- ☆20Updated last year
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆19Updated 2 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆54Updated 6 months ago
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆18Updated 2 years ago