cognitivecomputations / extract-expertLinks
Extract a single expert from a Mixture Of Experts model using slerp interpolation.
☆17Updated last year
Alternatives and similar repositories for extract-expert
Users that are interested in extract-expert are comparing it to the libraries listed below
Sorting:
- ☆66Updated last year
- entropix style sampling + GUI☆26Updated 7 months ago
- ☆19Updated last year
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- ☆72Updated last year
- ☆114Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 10 months ago
- ☆49Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 4 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 10 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- ☆53Updated last year
- ☆157Updated 10 months ago
- Let's create synthetic textbooks together :)☆75Updated last year
- ☆45Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆32Updated last year
- ☆48Updated last year
- ☆121Updated last month
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 3 months ago
- All the world is a play, we are but actors in it.☆50Updated this week
- This is our own implementation of 'Layer Selective Rank Reduction'☆238Updated last year
- autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…☆57Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated last year
- Scrape and export data from the Open LLM Leaderboard.☆45Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 8 months ago
- ☆30Updated 10 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆70Updated 7 months ago