☆52Jan 1, 2024Updated 2 years ago
Alternatives and similar repositories for Skill-Localization-by-grafting
Users that are interested in Skill-Localization-by-grafting are comparing it to the libraries listed below
Sorting:
- ☆37Apr 16, 2021Updated 4 years ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- Google Research☆46Oct 29, 2022Updated 3 years ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆52Dec 22, 2025Updated 2 months ago
- ☆14May 4, 2024Updated last year
- [NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"☆61Nov 26, 2023Updated 2 years ago
- [ICML 2023] Parameter-Level Soft-Masking for Continual Learning☆19Jul 13, 2023Updated 2 years ago
- Code for "The Expressive Power of Low-Rank Adaptation".☆20Apr 19, 2024Updated last year
- "Predict, then Interpolate: A Simple Algorithm to Learn Stable Classifiers" ICML 2021☆18Jun 1, 2021Updated 4 years ago
- Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"☆20Feb 2, 2022Updated 4 years ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Jun 13, 2024Updated last year
- This repo implements the CVPR23 paper Trainable Projected Gradient Method for Robust Fine-tuning☆24Nov 27, 2023Updated 2 years ago
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆154Jan 14, 2023Updated 3 years ago
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆32Mar 28, 2024Updated last year
- This repository contains the core implementation of our ICML 2025 paper: "Token Signature: Predicting Chain-of-Thought Gains with Token D…☆41Jul 18, 2025Updated 7 months ago
- Implementation of LPLR algorithm for matrix compression☆31Nov 21, 2023Updated 2 years ago
- [CCS 2021] TSS: Transformation-specific smoothing for robustness certification☆26Oct 3, 2023Updated 2 years ago
- Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxi…☆68Oct 18, 2021Updated 4 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆30Jul 16, 2023Updated 2 years ago
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆683Updated this week
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆30Jun 12, 2023Updated 2 years ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆80Mar 11, 2024Updated last year
- End-to-end integration of HuggingFace's models for sequence labeling.☆11Oct 4, 2020Updated 5 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆38Feb 27, 2024Updated 2 years ago
- ☆41Nov 30, 2023Updated 2 years ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆83Jul 29, 2025Updated 7 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆18Feb 14, 2026Updated 3 weeks ago
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆11Jun 2, 2024Updated last year
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Consensus Based Distributed Stochastic Gradient Descent☆11Jun 24, 2018Updated 7 years ago
- JMLR Cover Letter Template☆10Dec 15, 2021Updated 4 years ago
- ☆11Dec 5, 2020Updated 5 years ago
- [WebConf 2020] Searching for polarization in signed graphs: a local spectral approach☆10Feb 3, 2024Updated 2 years ago
- Efficient misspecification uncertainties for linear regression☆16Feb 19, 2026Updated 2 weeks ago
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆51Nov 8, 2024Updated last year