attentionmech / tensorlensLinks

aesthetic tensor visualiser

☆27

Alternatives and similar repositories for tensorlens

Users that are interested in tensorlens are comparing it to the libraries listed below

Sorting:

JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 9 months ago
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆68Updated last week
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆73Updated 7 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆84Updated 3 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
okarthikb / state-space-models
☆28Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 7 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆112Updated last month
QuixiAI / grokadamw
☆136Updated last year
joey00072 / Multi-Head-Latent-Attention-MLA-
working implimention of deepseek MLA
☆45Updated 11 months ago
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
kmohan321 / Research_Papers
☆46Updated 8 months ago
NousResearch / StripedHyenaTrainer
☆62Updated last year
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆61Updated last year
SinatrasC / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆17Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 10 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆46Updated 10 months ago
Think-a-Tron / evolve
open source alpha evolve
☆67Updated 6 months ago
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆99Updated 6 months ago
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 6 months ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated last year
xjdr-alt / muzero_sketch
☆40Updated last year
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆100Updated 6 months ago
usamec / lowmem_finetuning
Low memory full parameter finetuning of LLMs
☆54Updated 4 months ago
jfpuget / ARC-AGI-Challenge-2024
☆56Updated last year
HarleyCoops / smolThinker-.5B
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Updated last month
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 10 months ago
attentionmech / smolbox
smolbox of recipies
☆28Updated 7 months ago