Mihaiii / backtrack_samplerLinks

An easy-to-understand framework for LLM samplers that rewind and revise generated tokens

☆140

Alternatives and similar repositories for backtrack_sampler

Users that are interested in backtrack_sampler are comparing it to the libraries listed below

Sorting:

thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆175Updated last year
QuixiAI / grokadamw
☆134Updated 11 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 9 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 3 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 6 months ago
QuixiAI / spectrum
☆128Updated 3 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 9 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆101Updated 4 months ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆88Updated 2 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
QuixiAI / kraken
☆66Updated last year
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 3 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆81Updated 2 months ago
writer / writing-in-the-margins
☆118Updated 11 months ago
nyunAI / PruneGPT
☆51Updated last year
arcee-ai / DAM
☆53Updated 8 months ago
reka-ai / rekaquant
☆58Updated 3 weeks ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆65Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 2 months ago
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
euclaise / supertrainer2000
☆49Updated last year
LAION-AI / AIW
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated last month
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated 2 weeks ago