KhoomeiK / complexity-scalingView external linksLinks
gzip Predicts Data-dependent Scaling Laws
☆34May 28, 2024Updated last year
Alternatives and similar repositories for complexity-scaling
Users that are interested in complexity-scaling are comparing it to the libraries listed below
Sorting:
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 3 years ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- ☆14Mar 31, 2024Updated last year
- ☆15Apr 2, 2025Updated 10 months ago
- ☆18Mar 18, 2024Updated last year
- A Chrome extension that helps you stay focused by blocking sites during work timers and letting you browse during break timers. Now also …☆16Nov 22, 2018Updated 7 years ago
- ☆42Jun 19, 2024Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆21Dec 27, 2022Updated 3 years ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆23Jun 5, 2025Updated 8 months ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 8 months ago
- ☆25May 7, 2025Updated 9 months ago
- ☆54Oct 29, 2024Updated last year
- ☆53May 20, 2024Updated last year
- Simple Transformer in Jax☆142Jun 22, 2024Updated last year
- ☆26Sep 15, 2022Updated 3 years ago
- A toolkit for scaling law research ⚖☆57Jan 27, 2025Updated last year
- An annotated implementation of the Hyena Hierarchy paper☆34May 28, 2023Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆70Sep 25, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- A SapientML plugin of SapientMLGenerator☆11Dec 23, 2025Updated last month
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Apr 27, 2023Updated 2 years ago
- A light tensor library in zig.☆76Feb 9, 2025Updated last year
- Napari plugin for custom analysis and visualization of lattice lightsheet and Oblique Plane Microscopy data. The plugin is optimized for …☆14Feb 4, 2026Updated last week
- Converts stable diffusion embeddings to loadable pngs☆40Dec 6, 2022Updated 3 years ago
- Mathematical foundations of data analysis, Winter semester 22-23☆13Jan 31, 2023Updated 3 years ago
- A library for probing Stockfish's NNUEs. The code for reading parameters and forward propagation is taken from Stockfish☆12Nov 18, 2025Updated 2 months ago
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Adversarial Training and SFT for Bot Safety Models☆40Apr 18, 2023Updated 2 years ago
- ☆40Jul 26, 2024Updated last year
- Website for the MIT/Harvard Computational Neuroscience Journal Club☆11Apr 7, 2025Updated 10 months ago
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated last month
- ☆11Nov 2, 2023Updated 2 years ago
- (READ ONLY MIRROR) The ProB Model Checker and Animator Plugin for Rodin☆19Jan 24, 2026Updated 3 weeks ago
- SWIM protocol implementation for exchanging cluster membership status and metadata.☆11Oct 9, 2023Updated 2 years ago