Noumena-Network / NSA-TestLinks

NSA Triton Kernels written with GPT5 and Opus 4.1

☆65

Alternatives and similar repositories for NSA-Test

Users that are interested in NSA-Test are comparing it to the libraries listed below

Sorting:

HazyResearch / cartridges
Storing long contexts in tiny caches with self-study
☆218Updated last month
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆92Updated 2 months ago
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆112Updated last month
tokenbender / avataRL
rl from zero pretrain, can it be done? yes.
☆281Updated 2 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆149Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆138Updated 2 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 8 months ago
xjdr-alt / entropix-trainer
train entropix like a champ!
☆20Updated last year
xjdr-alt / muzero_sketch
☆40Updated last year
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆130Updated last year
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 6 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆72Updated 7 months ago
LeonGuertler / UnstableBaselines
☆106Updated last month
SinatrasC / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆17Updated last year
cloneofsimo / ptx-tutorial-by-aislop
PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)
☆66Updated 8 months ago
HazyResearch / train-tk
train with kittens!
☆63Updated last year
Amplify-Partners / annotation-reading-list
A reading list of relevant papers and projects on foundation model annotation
☆28Updated 9 months ago
brendanhogan / picoDeepResearch
☆68Updated 6 months ago
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆99Updated 6 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 8 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆73Updated 7 months ago
magicproduct / hash-hop
Long context evaluation for large language models
☆224Updated 9 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆197Updated last year
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆99Updated 4 months ago
tyler-romero / microR1
Simple repository for training small reasoning models
☆46Updated 9 months ago
xjdr-alt / mla_blog_translation
☆13Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago