brantondemoss / GrokkingComplexityLinks

Code for

☆27

Alternatives and similar repositories for GrokkingComplexity

Users that are interested in GrokkingComplexity are comparing it to the libraries listed below

Sorting:

tyler-romero / microR1
Simple repository for training small reasoning models
☆32Updated 6 months ago
joey00072 / microjax
Jax like function transformation engine but micro, microjax
☆33Updated 9 months ago
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆94Updated last week
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 7 months ago
epfml / DenseFormer
☆81Updated last year
SHI-Labs / CompactNet
☆31Updated last year
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆104Updated 3 weeks ago
dvruette / barrel-rec-pytorch
☆53Updated last year
google-deepmind / spectral_ssm
☆33Updated last year
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆100Updated 2 weeks ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 6 months ago
ExtensityAI / benchmark
Evaluation of neuro-symbolic engines
☆38Updated last year
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆85Updated last year
RobertCsordas / moeut
☆83Updated 11 months ago
iliao2345 / CompressARC
☆172Updated 3 months ago
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆48Updated last year
data-for-agents / insta
Official Repo for InSTA: Towards Internet-Scale Training For Agents
☆52Updated 3 weeks ago
MadryLab / platinum-benchmarks
☆29Updated 3 months ago
Aleph-Alpha-Research / trigrams
☆56Updated 2 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆190Updated 8 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 3 months ago
lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆56Updated 2 months ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆67Updated 11 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
Qualcomm-AI-research / codeit
☆27Updated last year
shreyansh26 / Attention-Mask-Patterns
Using FlexAttention to compute attention with different masking patterns
☆44Updated 10 months ago
jfpuget / ARC-AGI-Challenge-2024
☆56Updated 8 months ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆190Updated last year
likenneth / q_probe
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆41Updated last year
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆149Updated 6 months ago