stockeh / mlx-grokkingLinks

Grokking on modular arithmetic in less than 150 epochs in MLX

☆14

Alternatives and similar repositories for mlx-grokking

Users that are interested in mlx-grokking are comparing it to the libraries listed below

Sorting:

okarthikb / state-space-models
☆27Updated last year
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆99Updated 3 weeks ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
main-horse / hnet
H-Net Dynamic Hierarchical Architecture
☆65Updated 2 weeks ago
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆78Updated this week
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 5 months ago
xjdr-alt / mla_blog_translation
☆13Updated last year
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated 9 months ago
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated last month
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆69Updated 3 months ago
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Updated 2 weeks ago
PrimeIntellect-ai / smart-contracts
Solidity contracts for the decentralized Prime Network protocol
☆24Updated last month
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
xjdr-alt / muzero_sketch
☆38Updated last year
xjdr-alt / entropix-trainer
train entropix like a champ!
☆19Updated 9 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆190Updated 8 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 4 months ago
SonicCodes / lucid-v1
realtime latent world model inference demo
☆47Updated 8 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆66Updated 2 weeks ago
goodfire-ai / sdxl-turbo-interpretability
☆42Updated 2 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆32Updated 6 months ago
matttreed / diloco-sim
☆19Updated 7 months ago
attentionmech / tensorlens
aesthetic tensor visualiser
☆24Updated 3 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆72Updated 5 months ago
PrimeIntellect-ai / prime-vllm
Modded vLLM to run pipeline parallelism over public networks
☆37Updated 2 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 6 months ago
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆93Updated 4 months ago
ethansmith2000 / TransformerExperiments
☆19Updated 2 months ago
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 8 months ago