xjdr-alt / llmriLinks

look how they massacred my boy

☆63

Alternatives and similar repositories for llmri

Users that are interested in llmri are comparing it to the libraries listed below

Sorting:

xjdr-alt / muzero_sketch
☆40Updated last year
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated last year
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆99Updated 7 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆149Updated last year
brendanhogan / picoDeepResearch
☆68Updated 7 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated 2 months ago
samefarrar / entropix_mlx
Modify Entropy Based Sampling to work with Mac Silicon via MLX
☆49Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 9 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 2 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆77Updated 10 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated last year
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆141Updated last year
SinatrasC / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆17Updated last year
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆99Updated 5 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆107Updated 9 months ago
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆22Updated last year
doomslide / attention-graph
A graph visualization of attention
☆57Updated 7 months ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆84Updated 4 months ago
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 7 months ago
QuixiAI / grokadamw
☆136Updated last year
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 6 months ago
teknium1 / transformers-gptq-quant
☆45Updated 2 years ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆150Updated 10 months ago
xjdr-alt / entropix-trainer
train entropix like a champ!
☆20Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 10 months ago
vgel / logitloom
explore token trajectory trees on instruct and base models
☆149Updated 6 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆73Updated 8 months ago
jxmorris12 / embzip
lossily compress representation vectors using product quantization
☆59Updated last month
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆174Updated 11 months ago