Pleias/Quest-Best-Tokens

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Pleias/Quest-Best-Tokens)

Pleias / Quest-Best-Tokens

An introduction to LLM Sampling

☆80

Alternatives and similar repositories for Quest-Best-Tokens

Users that are interested in Quest-Best-Tokens are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SinatrasC / entropix-smollm
View on GitHub
smolLM with Entropix sampler on pytorch
☆148Oct 31, 2024Updated last year
SinatrasC / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆17Oct 9, 2024Updated last year
xjdr-alt / llmri
View on GitHub
look how they massacred my boy
☆63Oct 16, 2024Updated last year
enjalot / latent-sae
View on GitHub
Training code for Sparse Autoencoders on Embedding models
☆39Jul 11, 2026Updated 2 weeks ago
doomslide / attention-graph
View on GitHub
A graph visualization of attention
☆56May 20, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
showlab / GUI-Narrator
View on GitHub
Repository of GUI Action Narrator
☆13Apr 8, 2025Updated last year
strangeloopcanon / tevo
View on GitHub
TEVO: evolve LM motifs cheaply, then validate them in downstream train.py loops.
☆19Apr 18, 2026Updated 3 months ago
cloneofsimo / minSAE
View on GitHub
☆30Dec 2, 2024Updated last year
Mihaiii / backtrack_sampler
View on GitHub
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆151Jan 7, 2026Updated 6 months ago
opinionscience / BERTransfer
View on GitHub
A BERT-based application for reusable text classification at scale
☆37Jul 23, 2023Updated 3 years ago
Noumena-Network / NSA-Test
View on GitHub
NSA Triton Kernels written with GPT5 and Opus 4.1
☆70Aug 12, 2025Updated 11 months ago
Pleias / marginalia
View on GitHub
☆67Mar 4, 2024Updated 2 years ago
jxmorris12 / cde
View on GitHub
code for training & evaluating Contextual Document Embedding models
☆207May 14, 2025Updated last year
xjdr-alt / simple_transformer
View on GitHub
Simple Transformer in Jax
☆143Jun 22, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
haizelabs / j1-micro
View on GitHub
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆105Jul 19, 2025Updated last year
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated 2 years ago
huggingface / wikirace-llms
View on GitHub
☆27May 7, 2025Updated last year
YuchenJin / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆15Dec 5, 2024Updated last year
PrimeIntellect-ai / lab-cookbook
View on GitHub
Lab Cookbook
☆38Updated this week
xjdr-alt / entropix
View on GitHub
Entropy Based Sampling and Parallel CoT Decoding
☆3,431Nov 13, 2024Updated last year
SapienzaNLP / bookcoref
View on GitHub
Repository of the ACL 2025 main conference paper "BOOKCOREF: Coreference Resolution at Book Scale"
☆19Sep 1, 2025Updated 10 months ago
U-C4N / Deepseek-CoT
View on GitHub
Deepseek-CoT
☆10Oct 6, 2024Updated last year
Noumena-Network / nmoe
View on GitHub
MoE training for Me and You and maybe other people
☆395Mar 15, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jxbz / entropix
View on GitHub
📰 Computing the information content of trained neural networks
☆24Oct 8, 2021Updated 4 years ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
View on GitHub
☆53Feb 10, 2025Updated last year
WolframRavenwolf / MMLU-Pro
View on GitHub
MMLU-Pro eval results
☆15Aug 21, 2025Updated 11 months ago
ymym3412 / Hydra-MLflow-experiment-management
View on GitHub
Experiment management with Hydra and MLflow
☆13Nov 20, 2020Updated 5 years ago
stephantul / skeletoken
View on GitHub
Datamodels for hugging face tokenizers
☆109Updated this week
mkurman / grpo-llm-evaluator
View on GitHub
Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…
☆54May 7, 2025Updated last year
Pleias / Pleias-RAG-Library
View on GitHub
Python library to use Pleias-RAG models
☆72Jul 1, 2026Updated 3 weeks ago
JD-P / RetroInstruct
View on GitHub
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆34Oct 8, 2025Updated 9 months ago
vicksEmmanuel / latent-gemma
View on GitHub
☆27Jan 14, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alexzhang13 / longcot-mini-rlm-results
View on GitHub
Storing the LongCoT-mini results for RLM(GPT-5.2)
☆20Apr 26, 2026Updated 3 months ago
xjdr-alt / muzero_sketch
View on GitHub
☆40Jul 26, 2024Updated 2 years ago
graphcore-research / pytorch-tensor-tracker
View on GitHub
Flexibly track outputs and grad-outputs of torch.nn.Module.
☆13Oct 6, 2023Updated 2 years ago
doomslide / hyperobject
View on GitHub
Plotting (entropy, varentropy) for small LMs
☆99May 20, 2025Updated last year
Essential-AI / eai-taxonomy
View on GitHub
☆59Aug 19, 2025Updated 11 months ago
Mayank-Jain-1 / StreetFighter
View on GitHub
☆16Feb 13, 2026Updated 5 months ago
pradeesi / Paho-MQTT-with-Python
View on GitHub
This is sample code for Paho MQTT server with Python 2.7
☆10Mar 29, 2016Updated 10 years ago