flowritecom / flow-mergeLinks
flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popular merge methods such as model soups, SLERP, ties-MERGING or DARE.
☆20Updated 11 months ago
Alternatives and similar repositories for flow-merge
Users that are interested in flow-merge are comparing it to the libraries listed below
Sorting:
- An automated data pipeline scaling RL to pretraining levels☆73Updated 3 months ago
- ☆82Updated last year
- ☆120Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- Unofficial Implementation of Evolutionary Model Merging☆41Updated last year
- ☆55Updated last year
- [TMLR 2026] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models☆121Updated 11 months ago
- entropix style sampling + GUI☆27Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 10 months ago
- ☆51Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- ☆95Updated last year
- ☆26Updated last year
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆54Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆109Updated 8 months ago
- A repository for research on medium sized language models.☆77Updated last year
- Collection of autoregressive model implementation☆85Updated 3 weeks ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 8 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 10 months ago
- ☆29Updated 2 months ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- ☆85Updated 2 months ago
- ☆137Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆97Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated 3 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆143Updated 2 years ago
- Library to facilitate pruning of LLMs based on context☆32Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆147Updated last year