Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).
☆102Aug 8, 2025Updated 10 months ago
Alternatives and similar repositories for mergenetic
Users that are interested in mergenetic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tiny AI model embedded in NES ROMs to generate character names in-game.☆31Apr 3, 2026Updated 2 months ago
- Generic template to bootstrap your Python project.☆22Jun 8, 2026Updated last week
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 8 months ago
- FlexiTokens☆23Dec 27, 2025Updated 5 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆31Jun 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Personal implementation of ASIF by Antonio Norelli☆26May 24, 2024Updated 2 years ago
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆65Apr 26, 2023Updated 3 years ago
- Generic template to bootstrap your PyTorch project.☆651Oct 12, 2023Updated 2 years ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆34Oct 9, 2023Updated 2 years ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆19Jun 1, 2024Updated 2 years ago
- ☆15Apr 14, 2025Updated last year
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🔢 Work with static vector models☆39Apr 21, 2025Updated last year
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆47Feb 11, 2026Updated 4 months ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆18Sep 2, 2024Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆20Mar 27, 2023Updated 3 years ago
- ☆146Aug 20, 2025Updated 9 months ago
- A quick implementation of diffusion language models.☆49Oct 11, 2025Updated 8 months ago
- ☆27Dec 15, 2025Updated 6 months ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆32Jun 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Bayesian optimization with conformal coverage guarantees☆29Oct 28, 2022Updated 3 years ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆69Jul 6, 2025Updated 11 months ago
- Repository for the paper "Bayesian Model Selection of Lithium-Ion Battery Models via Bayesian Quadrature"☆14Apr 3, 2024Updated 2 years ago
- Use `outlines` generators with Haystack.☆15Jun 8, 2026Updated last week
- GPU-accelerated Ant Colony Optimization (ACO)☆17Feb 28, 2025Updated last year
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13May 3, 2023Updated 3 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Package to align tokens from different tokenizations.☆16Mar 25, 2024Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- FurNet: A Deep-Learning-Based Framework for Removing Furniture Objects in Room Image☆13Nov 22, 2022Updated 3 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Sep 14, 2024Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆754Updated this week
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- ☆17Jan 9, 2025Updated last year
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆20May 15, 2025Updated last year
- FusionBench: A Comprehensive Benchmark/Toolkit of Deep Model Fusion☆231Jun 11, 2026Updated last week