jerber / lang-jepaLinks

☆115

Alternatives and similar repositories for lang-jepa

Users that are interested in lang-jepa are comparing it to the libraries listed below

Sorting:

casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 5 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 8 months ago
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆97Updated last month
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆153Updated 2 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆96Updated 4 months ago
xjdr-alt / entropix-local
smol models are fun too
☆93Updated 8 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 4 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆318Updated 7 months ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆190Updated last year
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆314Updated 8 months ago
PrimeIntellect-ai / genesys
☆128Updated 3 months ago
facebookresearch / ExploreToM
Code for ExploreTom
☆84Updated 2 weeks ago
LeonGuertler / UnstableBaselines
☆82Updated last week
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆220Updated 7 months ago
google-deepmind / mishax
☆134Updated 3 months ago
SinatrasC / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆17Updated 9 months ago
rgreenblatt / arc_draw_more_samples_pub
Draw more samples
☆192Updated last year
iliao2345 / CompressARC
☆164Updated 3 months ago
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆207Updated this week
joshuacnf / Ctrl-G
☆86Updated 6 months ago
jerber / arc_agi
☆55Updated this week
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆91Updated 4 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆189Updated 7 months ago
arcprize / arc-agi-benchmarking
Testing baseline LLMs performance across various models
☆278Updated 3 weeks ago
brendanhogan / DeepSeekRL-Extended
Exploring Applications of GRPO
☆240Updated this week
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆149Updated 5 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆103Updated 2 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆65Updated 2 weeks ago