brantondemoss / GrokkingComplexity
Code for
☆27Updated 4 months ago
Alternatives and similar repositories for GrokkingComplexity:
Users that are interested in GrokkingComplexity are comparing it to the libraries listed below
- ☆31Updated last year
- Official Code Release for "Training a Generally Curious Agent"☆20Updated last month
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆41Updated 3 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- ☆53Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆19Updated last year
- LLM training in simple, raw C/CUDA☆14Updated 5 months ago
- ☆94Updated 3 months ago
- Jax like function transformation engine but micro, microjax☆31Updated 6 months ago
- Evaluation of neuro-symbolic engines☆35Updated 9 months ago
- ☆31Updated last year
- ☆81Updated last year
- ☆54Updated 8 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- This repo is based on https://github.com/jiaweizzhao/GaLore☆27Updated 7 months ago
- ☆59Updated last month
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆68Updated 2 weeks ago
- Official repo of paper LM2☆39Updated 2 months ago
- ☆52Updated 5 months ago
- Combining SOAP and MUON☆16Updated 2 months ago
- ☆53Updated 7 months ago
- ☆78Updated 8 months ago
- Collection of LLM completions for reasoning-gym task datasets☆19Updated last week
- ☆38Updated 9 months ago
- ☆25Updated last year
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated 3 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆99Updated 4 months ago
- ☆52Updated 11 months ago