Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆137Updated last month
Alternatives and similar repositories for backtrack_sampler:
Users that are interested in backtrack_sampler are comparing it to the libraries listed below
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆168Updated 2 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆113Updated 5 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 10 months ago
- ☆126Updated 7 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated last month
- look how they massacred my boy☆63Updated 5 months ago
- ☆111Updated 3 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆233Updated 9 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- An introduction to LLM Sampling☆77Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆52Updated last week
- ☆65Updated 9 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆115Updated last week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆224Updated 10 months ago
- entropix style sampling + GUI☆25Updated 4 months ago
- A pipeline for LLM knowledge distillation☆96Updated last month
- ☆110Updated 6 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- ☆53Updated 9 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆196Updated 8 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 5 months ago
- RWKV-7: Surpassing GPT☆81Updated 4 months ago
- Code for ExploreTom☆78Updated 3 months ago
- PyTorch implementation of models from the Zamba2 series.☆177Updated last month