KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆32Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for complexity-scaling
- ☆22Updated last year
- ☆27Updated 4 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆95Updated 6 months ago
- ☆24Updated 7 months ago
- ☆101Updated 3 months ago
- ☆57Updated 11 months ago
- Sparse and discrete interpretability tool for neural networks☆55Updated 9 months ago
- Code for minimum-entropy coupling.☆30Updated 4 months ago
- ☆36Updated 3 months ago
- ☆109Updated this week
- Understanding how features learned by neural networks evolve throughout training☆31Updated last month
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆29Updated 3 weeks ago
- Genetics for Language Models☆12Updated 4 months ago
- LLM training in simple, raw C/CUDA☆12Updated last month
- ☆19Updated 7 months ago
- ☆26Updated last year
- ☆46Updated last month
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 5 months ago
- Jax like function transformation engine but micro, microjax☆26Updated 3 weeks ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 5 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- Evaluation of neuro-symbolic engines☆33Updated 3 months ago
- ☆50Updated 6 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆20Updated 3 months ago
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆121Updated this week
- ☆77Updated 7 months ago
- ☆28Updated 5 months ago
- Experiments for efforts to train a new and improved t5☆76Updated 7 months ago
- train with kittens!☆49Updated 3 weeks ago