r-three/realistic_evaluation_of_model_merging_for_compositional_generalization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/r-three/realistic_evaluation_of_model_merging_for_compositional_generalization)

r-three / realistic_evaluation_of_model_merging_for_compositional_generalization

☆13

Alternatives and similar repositories for realistic_evaluation_of_model_merging_for_compositional_generalization

Users that are interested in realistic_evaluation_of_model_merging_for_compositional_generalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
tanganke / opcm
View on GitHub
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆25Oct 11, 2025Updated 9 months ago
yule-BUAA / MergeLLM
View on GitHub
Codes for Merging Large Language Models
☆37Aug 7, 2024Updated last year
MilesCranmer / pysr_scaling_laws
View on GitHub
You should use PySR to find scaling laws. Here's an example.
☆34Sep 30, 2023Updated 2 years ago
migalabs / eth-light-crawler
View on GitHub
This tool uses Ethereum's peer-discovery protocol to measure the size of the Ethereum network (testnets included)
☆15Dec 15, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Model-GLUE / Model-GLUE
View on GitHub
☆18Aug 19, 2024Updated last year
mchiquier / llm-mutate
View on GitHub
☆15Oct 7, 2024Updated last year
apple / ml-reversal-blessing
View on GitHub
☆17Jul 31, 2025Updated 11 months ago
joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
cs125-illinois / www-old
View on GitHub
Course website static sources.
☆11Dec 7, 2022Updated 3 years ago
sanketvmehta / lifelong-learning-pretraining-and-sam
View on GitHub
Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…
☆18Mar 18, 2024Updated 2 years ago
prometheus-eval / scaling-evaluation-compute
View on GitHub
Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"
☆12Mar 25, 2025Updated last year
cxa-unique / Simplified-TinyBERT
View on GitHub
ECIR'21: Simplified TinyBERT: Knowledge Distillation for Document Retrieval
☆17Apr 25, 2021Updated 5 years ago
kimyuji / EvolvingQA_benchmark
View on GitHub
Code and Dataset release of "Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models" (NAACL 2024)
☆10Oct 16, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
fredzzhang / atlas
View on GitHub
[NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"
☆28Feb 24, 2025Updated last year
pluck-lang / Pluck.jl
View on GitHub
An expressive language for discrete probabilistic programming with lazy knowledge compilation
☆17Apr 3, 2026Updated 3 months ago
glchau / TOTEM_for_EEG_code
View on GitHub
Code for using TOTEM on EEG data
☆15Sep 24, 2025Updated 10 months ago
zhuole1025 / LLMs_as_Visual_Explainers
View on GitHub
Official Repository for "LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions"
☆15Apr 20, 2025Updated last year
alphadl / R1
View on GitHub
🚀enhanced GRPO with more verifiable rewards and real-time evaluators
☆37Jan 27, 2026Updated 5 months ago
infinitered / keras-model-zoo
View on GitHub
Ready to go, downloadable models for Keras
☆11Oct 28, 2019Updated 6 years ago
xzhxzhxzhxzhxzh / WebAgent
View on GitHub
☆12Jan 1, 2024Updated 2 years ago
circle-hit / Lens
View on GitHub
Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"
☆12Oct 15, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kochbj / Reduced_Reused_Recycled
View on GitHub
Github for "Reduced, Reused and Recycled" (NeurIPS 2021 Best Paper, D&B Track)
☆17Jan 8, 2022Updated 4 years ago
Digitous / LLM-SLERP-Merge
View on GitHub
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆153Sep 10, 2023Updated 2 years ago
mlfoundations / patching
View on GitHub
Patching open-vocabulary models by interpolating weights
☆91Sep 28, 2023Updated 2 years ago
eberharf / cfl
View on GitHub
☆19May 5, 2026Updated 2 months ago
jxmorris12 / gptzip
View on GitHub
Losslessly encode text natively with arithmetic coding and HuggingFace Transformers
☆81Oct 28, 2025Updated 8 months ago
cimm-kzn / RuDReC
View on GitHub
Russian Drug Reaction Corpus (RuDReC)
☆13Dec 29, 2020Updated 5 years ago
daniel-furman / polyglot-or-not
View on GitHub
Are foundation LMs multilingual knowledge bases? (EMNLP 2023)
☆18Dec 8, 2023Updated 2 years ago
Clear-3d / torch-ngp-semantic
View on GitHub
An implementation of torchngp + semantic-nerf
☆13Sep 10, 2023Updated 2 years ago
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
EnnengYang / RepresentationSurgery
View on GitHub
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆49Oct 10, 2024Updated last year
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆235Jun 23, 2026Updated last month
wang-kee / LiNeS
View on GitHub
Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"
☆31Nov 4, 2024Updated last year
YingqiLiu1999 / DFedPGP
View on GitHub
☆14Jan 3, 2025Updated last year
kavigupta / urbanstats
View on GitHub
A website for viewing statistics of various areas in the United States.
☆27Updated this week
bilal-chughtai / rep-theory-mech-interp
View on GitHub
☆31May 4, 2023Updated 3 years ago
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago