cognitivecomputations / extract-expert
Extract a single expert from a Mixture Of Experts model using slerp interpolation.
☆17Updated 10 months ago
Alternatives and similar repositories for extract-expert:
Users that are interested in extract-expert are comparing it to the libraries listed below
- Let's create synthetic textbooks together :)☆74Updated last year
- ☆66Updated 10 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- entropix style sampling + GUI☆25Updated 5 months ago
- ☆20Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 2 months ago
- ☆112Updated 3 months ago
- ☆53Updated 10 months ago
- ☆73Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 8 months ago
- ☆48Updated 5 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 8 months ago
- ☆45Updated last year
- ☆48Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆86Updated 3 weeks ago
- ☆113Updated last week
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 6 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 10 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- ☆17Updated 4 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆42Updated 7 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 5 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…☆57Updated last year
- Lego for GRPO☆26Updated 2 weeks ago
- Prompt Jinja2 templates for LLMs☆31Updated 3 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆103Updated 4 months ago
- look how they massacred my boy☆63Updated 6 months ago