KhoomeiK / complexity-scalingLinks
gzip Predicts Data-dependent Scaling Laws
☆35Updated last year
Alternatives and similar repositories for complexity-scaling
Users that are interested in complexity-scaling are comparing it to the libraries listed below
Sorting:
- ☆61Updated last year
- Code for minimum-entropy coupling.☆32Updated last year
- ☆27Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 8 months ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆123Updated 6 months ago
- ☆81Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- ☆68Updated 11 months ago
- ☆134Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆62Updated 8 months ago
- train with kittens!☆61Updated 8 months ago
- ☆53Updated last year
- ☆38Updated 11 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 3 months ago
- ☆22Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆81Updated 3 years ago
- ☆35Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- ☆45Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆25Updated 7 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆46Updated 3 months ago
- ☆56Updated 8 months ago
- ☆53Updated last year
- ☆101Updated 5 months ago
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆113Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆147Updated 2 weeks ago
- Simple GRPO scripts and configurations.☆59Updated 5 months ago
- ☆53Updated 9 months ago