tommasomncttn/mergenetic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tommasomncttn/mergenetic)

tommasomncttn / mergenetic

Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).

☆104

Alternatives and similar repositories for mergenetic

Users that are interested in mergenetic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OmnAI-Lab / spilled-energy
View on GitHub
Energy-based Hallucination detection.
☆25Mar 3, 2026Updated 4 months ago
autonomousvision / mvdatasets
View on GitHub
Standardized DataLoaders for 3D Computer Vision
☆26Mar 28, 2025Updated last year
erodola / bigram-nes
View on GitHub
Tiny AI model embedded in NES ROMs to generate character names in-game.
☆33Apr 3, 2026Updated 3 months ago
aimagelab / TransFusion
View on GitHub
Official codebase of "Update Your Transformer to the Latest Release: Re-Basin of Task Vectors" - ICML 2025
☆23Jul 30, 2025Updated 11 months ago
Flegyas / latentis
View on GitHub
A Python package for analyzing and transforming neural latent spaces.
☆53Mar 4, 2026Updated 4 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
AntoAndGar / task_singular_vectors
View on GitHub
Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.
☆57Dec 15, 2025Updated 7 months ago
crisostomi / metric-few-shot-graph
View on GitHub
Few-Shot Graph Classification via distance metric learning.
☆24Mar 20, 2024Updated 2 years ago
gsarti / it5
View on GitHub
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹
☆31Jun 17, 2024Updated 2 years ago
taidopurason / tokenizer-extension
View on GitHub
☆15Dec 4, 2025Updated 7 months ago
grok-ai / py-template
View on GitHub
Generic template to bootstrap your Python project.
☆22Updated this week
bminixhofer / tokenkit
View on GitHub
A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.
☆70Jul 6, 2025Updated last year
erodola / DLAI-s2-2020
View on GitHub
Teaching material for the course of Deep Learning and Applied AI, 2nd semester 2020, Sapienza University of Rome
☆34Aug 24, 2020Updated 5 years ago
michelemancusi / LQVAE-separation
View on GitHub
☆45Feb 17, 2022Updated 4 years ago
zihuanqiu / MINGLE
View on GitHub
The code repository for "MINGLE: Mixture of Null-Space Gated Low-Rank Experts for Test-Time Continual Model Merging"(NeurIPS25) in PyTorc…
☆15Jun 2, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
embedl / embedl-models
View on GitHub
⛔ DEPRECATED -- use flash-head instead (pip install flash-head)
☆29Apr 10, 2026Updated 3 months ago
thad0ctor / KrunchWrapper
View on GitHub
☆18Jul 1, 2025Updated last year
eliahuhorwitz / ProbeX
View on GitHub
Official PyTorch Implementation for the "Learning on Model Weights using Tree Experts" paper (CVPR 2025).
☆16Feb 11, 2026Updated 5 months ago
Model-GLUE / Model-GLUE
View on GitHub
☆18Aug 19, 2024Updated last year
explosion / curated-tokenizers
View on GitHub
Lightweight piece tokenization library
☆12Apr 15, 2024Updated 2 years ago
diningphil / continual_learning_for_graphs
View on GitHub
☆13Feb 16, 2021Updated 5 years ago
allenai / olmix
View on GitHub
☆41May 26, 2026Updated 2 months ago
hipe-eval / HIPE-2022-data
View on GitHub
Data for the HIPE 2022 shared task.
☆23May 15, 2026Updated 2 months ago
EnnengYang / Awesome-Model-Merging-Methods-Theories-Applications
View on GitHub
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.
☆771Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Aratako / Task-Vector-Merge-Optimzier
View on GitHub
☆16Apr 11, 2024Updated 2 years ago
flowersteam / vivarium
View on GitHub
Multi-agent simulator in Jax for research and teaching in AI & ALife
☆31Apr 11, 2026Updated 3 months ago
levtelyatnikov / radiomixer
View on GitHub
radiomixer
☆14Feb 16, 2022Updated 4 years ago
prateeky2806 / ties-merging
View on GitHub
☆217Feb 3, 2024Updated 2 years ago
MrZilinXiao / ProxyThinker
View on GitHub
[ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.
☆22Sep 24, 2025Updated 10 months ago
UniReps / UniReps-resources
View on GitHub
☆27Dec 3, 2023Updated 2 years ago
3diglab / geomfum
View on GitHub
Geometry processing and machine learning with functional maps.
☆69Jun 26, 2026Updated last month
lucmos / relreps
View on GitHub
Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…
☆65Apr 26, 2023Updated 3 years ago
tanganke / fusion_bench
View on GitHub
FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion
☆236Jun 23, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
tanganke / opcm
View on GitHub
official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"
☆25Oct 11, 2025Updated 9 months ago
kyegomez / Hedgehog
View on GitHub
Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"
☆16Mar 11, 2024Updated 2 years ago
DunZhang / Jasper-Token-Compression-Training
View on GitHub
The training codes of Jasper-Token-Compression-600M
☆20Nov 19, 2025Updated 8 months ago
QuixiAI / spectrum
View on GitHub
☆145Aug 20, 2025Updated 11 months ago
cimeister / tokenizer-intrinsic-evals
View on GitHub
TokEval: intrinsic quality metrics for tokenizers across natural language, code, and math
☆46Jul 4, 2026Updated 3 weeks ago
allenai / infinigram-api
View on GitHub
☆102Jul 16, 2026Updated last week
kensho-technologies / pathpiece
View on GitHub
PathPiece tokenizer
☆14Nov 10, 2024Updated last year