cognitivecomputations / extract-expertLinks

Extract a single expert from a Mixture Of Experts model using slerp interpolation.

☆17

Alternatives and similar repositories for extract-expert

Users that are interested in extract-expert are comparing it to the libraries listed below

Sorting:

cognitivecomputations / kraken
☆66Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
arcee-ai / DAM
☆51Updated 7 months ago
emrgnt-cmplxty / SmolTrainer
☆20Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 7 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
shirley-wu / cot_decoding
☆45Updated last year
zarakiquemparte / zaraki-tools
☆27Updated last year
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 3 weeks ago
brendanhogan / picoDeepResearch
☆63Updated last month
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 4 months ago
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated 11 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆63Updated 2 months ago
JoshuaPurtell / SmallBench
Small, simple agent task environments for training and evaluation
☆18Updated 7 months ago
teknium1 / ShareGPT-Builder
☆114Updated 6 months ago
cognitivecomputations / spectrum
☆124Updated 2 months ago
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 5 months ago
cognitivecomputations / grokadamw
☆132Updated 10 months ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated last year
cognitivecomputations / OpenChatML
☆157Updated 11 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 5 months ago
axolotl-ai-cloud / axolotl-cookbook
☆34Updated 3 months ago
bdambrosio / AllTheWorldAPlay
All the world is a play, we are but actors in it.
☆50Updated this week
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated 10 months ago
cognitivecomputations / dolphin-logger
☆96Updated last week
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆102Updated 2 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 7 months ago