dingo-actual / omLinks

An LLM architecture utilizing a recurrent structure and multi-layer memory

☆13

Alternatives and similar repositories for om

Users that are interested in om are comparing it to the libraries listed below

Sorting:

joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 8 months ago
QuixiAI / grokadamw
☆134Updated 11 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 3 months ago
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated last week
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 6 months ago
main-horse / hnet
H-Net Dynamic Hierarchical Architecture
☆55Updated this week
SalesforceAIResearch / LaTRO
☆117Updated 5 months ago
kyleliang919 / Online-Subspace-Descent
[NeurIPS 2024] Low rank memory efficient optimizer without SVD
☆30Updated 3 weeks ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆67Updated 3 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 5 months ago
google-deepmind / mishax
☆134Updated 3 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
SinatrasC / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆17Updated 9 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
CLAIRE-Labo / quantile-reward-policy-optimization
Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …
☆20Updated last week
JackCai1206 / arithmetic-self-improve
☆34Updated 5 months ago
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆64Updated 8 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆97Updated 2 months ago
xjdr-alt / entropix-local
smol models are fun too
☆93Updated 8 months ago
kyleliang919 / Super_Muon
☆59Updated 4 months ago
QuixiAI / spectrum
☆128Updated 3 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆100Updated 4 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆65Updated this week
SeunghyunSEO / optimized_hf_llama_class_for_training
☆48Updated 10 months ago
jerber / lang-jepa
☆117Updated 7 months ago
euclaise / supertrainer2000
☆49Updated last year
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 3 months ago