attentionmech / smolboxLinks

smolbox of recipies

☆28

Alternatives and similar repositories for smolbox

Users that are interested in smolbox are comparing it to the libraries listed below

Sorting:

JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆107Updated 8 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 9 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 10 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Updated 7 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated last year
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆67Updated 2 weeks ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆84Updated 3 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆98Updated 6 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 9 months ago
QuixiAI / grokadamw
☆136Updated last year
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 5 months ago
euclaise / supertrainer2000
☆50Updated last year
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆66Updated this week
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated last year
xjdr-alt / muzero_sketch
☆40Updated last year
geronimi73 / phi2-finetune
☆86Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆200Updated 6 months ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated last year
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆107Updated 8 months ago
AlexBodner / How_Much_VRAM
☆102Updated last year
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆68Updated last year
brendanhogan / picoDeepResearch
☆68Updated 6 months ago
NousResearch / StripedHyenaTrainer
☆62Updated last year
vicksEmmanuel / latent-gemma
☆26Updated 10 months ago