SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆139Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for entropix-smollm
- ☆94Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- look how they massacred my boy☆58Updated last month
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆55Updated 2 weeks ago
- smol models are fun too☆77Updated last week
- An introduction to LLM Sampling☆64Updated last week
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆48Updated 2 weeks ago
- ☆118Updated 3 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last month
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆96Updated this week
- code for training & evaluating Contextual Document Embedding models☆117Updated this week
- ☆227Updated last month
- ☆93Updated last month
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆282Updated last month
- Simple Transformer in Jax☆119Updated 4 months ago
- ☆49Updated 8 months ago
- ☆104Updated 8 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆151Updated this week
- Fast parallel LLM inference for MLX☆149Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- entropix style sampling + GUI☆25Updated 3 weeks ago
- ☆74Updated 3 weeks ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- ☆48Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 5 months ago
- ☆64Updated 5 months ago
- ☆20Updated 2 weeks ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆177Updated last month