brantondemoss / GrokkingComplexityLinks
Code for
☆27Updated 6 months ago
Alternatives and similar repositories for GrokkingComplexity
Users that are interested in GrokkingComplexity are comparing it to the libraries listed below
Sorting:
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 2 weeks ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- ☆32Updated last year
- ☆53Updated 8 months ago
- Simple repository for training small reasoning models☆33Updated 4 months ago
- Evaluation of neuro-symbolic engines☆35Updated 10 months ago
- ☆79Updated 10 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 6 months ago
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- ☆31Updated last year
- ☆81Updated last year
- ☆20Updated last week
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 9 months ago
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆20Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 3 months ago
- Esoteric Language Models☆77Updated last week
- Official repo of paper LM2☆41Updated 4 months ago
- ☆98Updated 5 months ago
- ☆38Updated 11 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated this week
- ☆27Updated 11 months ago
- ☆26Updated last year
- ☆52Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- ☆56Updated last month
- Official Code Release for "Training a Generally Curious Agent"☆25Updated last month
- ☆53Updated last year
- Using FlexAttention to compute attention with different masking patterns☆44Updated 9 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆103Updated 2 months ago