strangeloopcanon / LLMRankLinks

PageRank for LLMs

☆43

Alternatives and similar repositories for LLMRank

Users that are interested in LLMRank are comparing it to the libraries listed below

Sorting:

Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
xjdr-alt / muzero_sketch
☆38Updated 11 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 4 months ago
euclaise / supertrainer2000
☆49Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 4 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆63Updated 2 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 8 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 2 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆48Updated 3 months ago
Ziems / arbor
A framework for optimizing DSPy programs with RL
☆89Updated this week
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
enjalot / latent-sae
Training code for Sparse Autoencoders on Embedding models
☆38Updated 4 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆48Updated 5 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
AnswerDotAI / fastkmeans
☆62Updated last week
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆91Updated last month
thomasnormal / fewshot
☆28Updated 3 weeks ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆65Updated 2 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 4 months ago
google-deepmind / mishax
☆134Updated 3 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆63Updated 3 weeks ago
Aleph-Alpha-Research / scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆62Updated 8 months ago
jerber / lang-jepa
☆116Updated 6 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 4 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆194Updated 2 months ago
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆21Updated 8 months ago