hkust-nlp/PEM_composition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hkust-nlp/PEM_composition)

hkust-nlp / PEM_composition

[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"

☆61

Alternatives and similar repositories for PEM_composition

Users that are interested in PEM_composition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MANGA-UOFA / PTfer
View on GitHub
☆11Nov 13, 2024Updated last year
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
bloomberg / dataless-model-merging
View on GitHub
Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)
☆92Jul 25, 2023Updated 2 years ago
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
duguodong7 / pcb-merging
View on GitHub
[NeurIPS 2024] For paper Parameter Competition Balancing for Model Merging
☆48Oct 11, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
circle-hit / KBCIN
View on GitHub
Code for AAAI 2023 accepted paper titled "Knowledge-Bridged Causal Interaction Network for Causal Emotion Entailment"
☆14May 6, 2023Updated 3 years ago
abhishekpanigrahi1996 / Skill-Localization-by-grafting
View on GitHub
☆52Jan 1, 2024Updated 2 years ago
Raincleared-Song / ConPET
View on GitHub
Source code for a LoRA-based continual relation extraction method.
☆14Sep 25, 2023Updated 2 years ago
sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
EnnengYang / AdaMerging
View on GitHub
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆113Oct 28, 2024Updated last year
sail-sg / lorahub
View on GitHub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
☆671Jul 22, 2024Updated 2 years ago
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆870May 5, 2024Updated 2 years ago
Hannibal046 / PlugLM
View on GitHub
[ACL2023] Source code for Decouple knowledge from paramters for plug-and-play language modeling
☆20Sep 18, 2023Updated 2 years ago
prateeky2806 / ComPEFT
View on GitHub
☆26Nov 23, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
weigq / openview_quicklook
View on GitHub
☆36Mar 10, 2025Updated last year
F2-Song / ICDPO
View on GitHub
The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…
☆16Feb 15, 2024Updated 2 years ago
llyx97 / TAMT
View on GitHub
[NAACL 2022] "Learning to Win Lottery Tickets in BERT Transfer via Task-agnostic Mask Training", Yuanxin Liu, Fandong Meng, Zheng Lin, Pe…
☆15Oct 18, 2022Updated 3 years ago
jiaconghu / Model-LEGO
View on GitHub
Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks
☆17Jan 15, 2025Updated last year
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
yhit98 / FITE
View on GitHub
☆16May 15, 2023Updated 3 years ago
mlfoundations / task_vectors
View on GitHub
Editing Models with Task Arithmetic
☆548Jan 11, 2024Updated 2 years ago
hkust-nlp / llm-compression-intelligence
View on GitHub
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆150Sep 20, 2024Updated last year
izhx / uni-rep
View on GitHub
Code for embedding and retrieval research.
☆16Oct 24, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
StyxXuan / LoraRetriever
View on GitHub
☆17Apr 29, 2025Updated last year
ars22 / scaling-LLM-math-synthetic-data
View on GitHub
Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"
☆32Jun 16, 2024Updated 2 years ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆13Feb 11, 2026Updated 5 months ago
ElisaNguyen / bayesian-tda
View on GitHub
Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"
☆17Jan 12, 2024Updated 2 years ago
milesaturpin / cot-unfaithfulness
View on GitHub
☆57Oct 23, 2023Updated 2 years ago
ndaheim / faithful-dialogue
View on GitHub
☆23Mar 31, 2023Updated 3 years ago
circle-hit / CauAIN
View on GitHub
Code for IJCAI 2022 accepted paper titled "CauAIN: Causal Aware Interaction Network for Emotion Recognition in Conversations"
☆24Jun 11, 2023Updated 3 years ago
declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
r-three / phatgoose
View on GitHub
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆93Feb 27, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xlang-ai / batch-prompting
View on GitHub
[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.
☆76Mar 8, 2024Updated 2 years ago
RUCAIBox / ComVint
View on GitHub
The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…
☆19Nov 10, 2023Updated 2 years ago
fxmeng / mixtral_spliter
View on GitHub
Converting Mixtral-8x7B to Mixtral-[1~7]x7B
☆22Mar 4, 2024Updated 2 years ago
VITA-Group / instant_soup
View on GitHub
[ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…
☆11Nov 28, 2023Updated 2 years ago
KwanWaiChung / M4LE
View on GitHub
Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
☆23Jul 27, 2024Updated last year
zsLin177 / SRL-as-GP
View on GitHub
☆18Mar 10, 2023Updated 3 years ago
maszhongming / ParaKnowTransfer
View on GitHub
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆33May 9, 2024Updated 2 years ago