HazyResearch / cartridgesLinks

Storing long contexts in tiny caches with self-study

☆218

Alternatives and similar repositories for cartridges

Users that are interested in cartridges are comparing it to the libraries listed below

Sorting:

microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆302Updated last month
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 10 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? yes.
☆281Updated 2 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆111Updated 7 months ago
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆99Updated 4 months ago
alexzhang13 / rlm
Super basic implementation (gist-like) of RLMs with REPL environments.
☆273Updated last month
PrimeIntellect-ai / prime-environments
Training-Ready RL Environments + Evals
☆182Updated this week
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆72Updated 7 months ago
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆189Updated 8 months ago
google-deepmind / mishax
☆143Updated 2 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆201Updated 6 months ago
LeonGuertler / UnstableBaselines
☆106Updated last month
pyember / ember
☆234Updated 5 months ago
magicproduct / hash-hop
Long context evaluation for large language models
☆224Updated 9 months ago
character-ai / pipelining-sft
Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
☆98Updated 4 months ago
HazyResearch / lolcats
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
☆249Updated 10 months ago
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆112Updated last month
Danau5tin / calculator_agent_rl
Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.
☆60Updated 6 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆107Updated 8 months ago
PrimeIntellect-ai / genesys
☆136Updated 8 months ago
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆130Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 8 months ago
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆316Updated this week
Noumena-Network / NSA-Test
NSA Triton Kernels written with GPT5 and Opus 4.1
☆65Updated 3 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated last month
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆73Updated 7 months ago
ScalingIntelligence / tokasaurus
☆456Updated last week
brendanhogan / picoDeepResearch
☆68Updated 6 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
ScalingIntelligence / codemonkeys
☆59Updated 10 months ago