Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).
☆102Aug 8, 2025Updated 8 months ago
Alternatives and similar repositories for mergenetic
Users that are interested in mergenetic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Teaching material for the course of Deep Learning and Applied AI, 2nd semester 2020, Sapienza University of Rome☆34Aug 24, 2020Updated 5 years ago
- FlexiTokens☆19Dec 27, 2025Updated 3 months ago
- ☆45Feb 17, 2022Updated 4 years ago
- Personal implementation of ASIF by Antonio Norelli☆26May 24, 2024Updated last year
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆65Apr 26, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Generic template to bootstrap your PyTorch project.☆650Oct 12, 2023Updated 2 years ago
- ☆14Apr 14, 2025Updated last year
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Oct 9, 2023Updated 2 years ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆18Jun 1, 2024Updated last year
- ☆36Mar 26, 2022Updated 4 years ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated last month
- 🔢 Work with static vector models☆39Apr 21, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆46Feb 11, 2026Updated 2 months ago
- ☆145Aug 20, 2025Updated 7 months ago
- ☆22Dec 15, 2025Updated 4 months ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆31Jun 7, 2024Updated last year
- Bayesian optimization with conformal coverage guarantees☆29Oct 28, 2022Updated 3 years ago
- Released code for ICDM 2016 Budgeted Batch Bayesian Optimization☆10Feb 11, 2019Updated 7 years ago
- Repository for the paper "Bayesian Model Selection of Lithium-Ion Battery Models via Bayesian Quadrature"☆14Apr 3, 2024Updated 2 years ago
- GPU-accelerated Ant Colony Optimization (ACO)☆17Feb 28, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation☆10May 20, 2021Updated 4 years ago
- Use `outlines` generators with Haystack.☆15Updated this week
- A collection of papers related to knowledge fusion☆57Oct 11, 2024Updated last year
- ☆97Mar 11, 2026Updated last month
- ☆18Aug 19, 2024Updated last year
- Official implementation for the paper "Sample-Then-Optimize Batch Neural Thompson Sampling", published at NeurIPS 2022.☆10Oct 13, 2022Updated 3 years ago
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13May 3, 2023Updated 2 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NLP Preprocessing Pipeline Wrappers☆11May 12, 2023Updated 2 years ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 11 months ago
- FurNet: A Deep-Learning-Based Framework for Removing Furniture Objects in Room Image☆14Nov 22, 2022Updated 3 years ago
- ☆24Dec 11, 2024Updated last year
- Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. ACM Computing Surveys, 2026.☆713Updated this week
- Evaluate Transformers from the Hub 🔥☆14Apr 3, 2026Updated 2 weeks ago
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 4 months ago