QuixiAI / extract-expertLinks

Extract a single expert from a Mixture Of Experts model using slerp interpolation.

☆17

Alternatives and similar repositories for extract-expert

Users that are interested in extract-expert are comparing it to the libraries listed below

Sorting:

QuixiAI / kraken
☆67Updated last year
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆240Updated last year
teknium1 / ShareGPT-Builder
☆116Updated 9 months ago
emrgnt-cmplxty / zero-shot-replication
☆74Updated 2 years ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆177Updated last year
QuixiAI / OpenChatML
☆161Updated last month
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆107Updated last year
QuixiAI / grokadamw
☆135Updated last year
teknium1 / transformers-gptq-quant
☆46Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated 10 months ago
rafacelente / bllama
1.58-bit LLaMa model
☆82Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆146Updated 7 months ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
QuixiAI / spectrum
☆135Updated last month
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆208Updated last year
emrgnt-cmplxty / SmolTrainer
☆20Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
arcee-ai / DAM
☆54Updated 10 months ago
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated 2 years ago
kerekovskik / autologic
autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…
☆60Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆83Updated 2 years ago
JoshuaPurtell / SmallBench
Small, simple agent task environments for training and evaluation
☆18Updated 10 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆92Updated 8 months ago
xjdr-alt / llmri
look how they massacred my boy
☆64Updated 11 months ago
michaelfeil / embed
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
☆45Updated last year
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 10 months ago