gzip Predicts Data-dependent Scaling Laws
☆35May 28, 2024Updated 2 years ago
Alternatives and similar repositories for complexity-scaling
Users that are interested in complexity-scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 31, 2024Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- ☆28May 7, 2025Updated last year
- ☆18Mar 18, 2024Updated 2 years ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆28May 27, 2025Updated last year
- ☆54May 20, 2024Updated 2 years ago
- ☆45Jun 19, 2024Updated 2 years ago
- High-performance tokenized language data-loader for Python C++ extension☆15Jul 22, 2024Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 4 years ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated last year
- ☆16Apr 2, 2025Updated last year
- ☆53Oct 29, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- PyTorch implementation for MRL☆23Feb 22, 2024Updated 2 years ago
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- Simple Transformer in Jax☆144Jun 22, 2024Updated last year
- playing with gpt4☆13Mar 17, 2023Updated 3 years ago
- A toolkit for scaling law research ⚖☆65Jan 27, 2025Updated last year
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆23Apr 8, 2024Updated 2 years ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆73Sep 25, 2024Updated last year
- ☆19Nov 4, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆26Sep 3, 2025Updated 9 months ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated 2 years ago
- ☆26Sep 15, 2022Updated 3 years ago
- A light tensor library in zig.☆77Feb 9, 2025Updated last year
- Your favourite classical machine learning algos on the GPU/TPU☆23Dec 14, 2025Updated 6 months ago
- A Chrome extension that helps you stay focused by blocking sites during work timers and letting you browse during break timers. Now also …☆16Nov 22, 2018Updated 7 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Dec 27, 2022Updated 3 years ago
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- Vector search over tweets from the tweet archive using OpenAI embeddings and LanceDB☆58Mar 25, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Napari plugin for custom analysis and visualization of lattice lightsheet and Oblique Plane Microscopy data. The plugin is optimized for …☆15Jun 12, 2026Updated last week
- An Offline Wikipedia Dump Reader in Javascript that probably only works on Chrome☆19Dec 23, 2011Updated 14 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆93Feb 27, 2024Updated 2 years ago
- ☆30Oct 8, 2024Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- [TMLR'25] Official implementation for "Large-Scale Targeted Cause Discovery via Learning from Simulated Data"☆27Sep 30, 2025Updated 8 months ago
- Repo for the CZI Imaging Team's napari plugin Alfa Cohort collaboration☆11May 14, 2021Updated 5 years ago