arcee-ai / mergekitLinks

Tools for merging pretrained large language models.

☆6,378

Alternatives and similar repositories for mergekit

Users that are interested in mergekit are comparing it to the libraries listed below

Sorting:

axolotl-ai-cloud / axolotl
Go ahead and axolotl questions
☆10,634Updated last week
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆5,398Updated last month
EleutherAI / lm-evaluation-harness
A framework for few-shot evaluation of language models.
☆10,373Updated last week
meta-pytorch / torchtune
PyTorch native post-training library
☆5,547Updated this week
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,903Updated this week
allenai / open-instruct
AllenAI's post-training codebase
☆3,252Updated last week
AutoGPTQ / AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆4,970Updated 6 months ago
casper-hansen / AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
☆2,260Updated 5 months ago
huggingface / datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆2,673Updated last week
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,659Updated 3 weeks ago
mit-han-lab / llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
☆3,318Updated 3 months ago
huggingface / trl
Train transformer language models with reinforcement learning.
☆15,934Updated this week
turboderp-org / exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
☆4,346Updated 2 months ago
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,267Updated last month
stanfordnlp / pyreft
Stanford NLP Python library for Representation Finetuning (ReFT)
☆1,514Updated 8 months ago
meta-pytorch / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,128Updated 2 months ago
gkamradt / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆2,056Updated last year
S-LoRA / S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
☆1,859Updated last year
EleutherAI / pythia
The hub for EleutherAI's work on interpretability and learning dynamics
☆2,639Updated 4 months ago
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆2,021Updated this week
McGill-NLP / llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
☆1,599Updated 9 months ago
XueFuzhao / OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,614Updated last year
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,580Updated last month
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,518Updated 5 months ago
openai / transformer-debugger
☆4,100Updated last year
noamgat / lm-format-enforcer
Enforce the output format (JSON Schema, Regex etc) of a language model
☆1,943Updated 2 months ago
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,619Updated last year
jiaweizzhao / GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,610Updated 11 months ago
allenai / OLMo
Modeling, training, eval, and inference code for OLMo
☆6,044Updated last week
mistralai / mistral-finetune
☆3,031Updated last year