Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).
☆102Aug 8, 2025Updated 9 months ago
Alternatives and similar repositories for mergenetic
Users that are interested in mergenetic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official codebase of "Update Your Transformer to the Latest Release: Re-Basin of Task Vectors" - ICML 2025☆23Jul 30, 2025Updated 9 months ago
- A PyTorch-based neural implicit geometry toolbox.☆16Jul 25, 2022Updated 3 years ago
- FlexiTokens☆23Dec 27, 2025Updated 5 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆31Jun 17, 2024Updated last year
- Personal implementation of ASIF by Antonio Norelli☆26May 24, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ANE accelerated embedding models!☆19Dec 11, 2024Updated last year
- User-friendly viewer for Parquet files☆12May 8, 2026Updated 3 weeks ago
- 🚀🤗 A collection of templates for Hugging Face Spaces☆34Oct 9, 2023Updated 2 years ago
- Code for the papers: "Graph Representation Learning for Multi-Task Settings: a Meta-Learning Approach", "A Meta-Learning Approach for Gra…☆18Apr 26, 2022Updated 4 years ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆19Jun 1, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- ☆15Apr 14, 2025Updated last year
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🔢 Work with static vector models☆39Apr 21, 2025Updated last year
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆17Sep 2, 2024Updated last year
- ☆26Dec 15, 2025Updated 5 months ago
- Code for paper "Towards Efficient Pareto Set Approximation via Weight-Ensembling Mixture of Experts"☆11Sep 13, 2024Updated last year
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆67Jul 6, 2025Updated 10 months ago
- Bayesian optimization with conformal coverage guarantees☆29Oct 28, 2022Updated 3 years ago
- Released code for ICDM 2016 Budgeted Batch Bayesian Optimization☆10Feb 11, 2019Updated 7 years ago
- Gradient Descent optimizers for Julia☆12May 26, 2020Updated 6 years ago
- Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation☆10May 20, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Use `outlines` generators with Haystack.☆15May 18, 2026Updated last week
- Prototyping Area HAT (Hardware Attached on Top) for Raspberry Pi☆11May 20, 2020Updated 6 years ago
- GPU-accelerated Ant Colony Optimization (ACO)☆17Feb 28, 2025Updated last year
- ☆14Nov 2, 2022Updated 3 years ago
- ☆18Aug 19, 2024Updated last year
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆19May 15, 2025Updated last year
- NLP Preprocessing Pipeline Wrappers☆11May 12, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆24Dec 11, 2024Updated last year
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Sep 14, 2024Updated last year
- Evaluate Transformers from the Hub 🔥☆14Apr 3, 2026Updated last month
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 6 months ago
- code for BINOCULARS and Multi-Step BO☆12Dec 7, 2020Updated 5 years ago
- ☆17Jan 9, 2025Updated last year
- A concise list of CLI coding tools similar to Claude Code☆39Apr 13, 2026Updated last month