gzip Predicts Data-dependent Scaling Laws
☆35May 28, 2024Updated last year
Alternatives and similar repositories for complexity-scaling
Users that are interested in complexity-scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 31, 2024Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- ☆18Mar 18, 2024Updated 2 years ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 11 months ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆54May 20, 2024Updated last year
- ☆45Jun 19, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 4 years ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 10 months ago
- PyTorch implementation for MRL☆23Feb 22, 2024Updated 2 years ago
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- Scripts for quantifying stuff from my life☆19Nov 1, 2015Updated 10 years ago
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- playing with gpt4☆14Mar 17, 2023Updated 3 years ago
- A toolkit for scaling law research ⚖☆60Jan 27, 2025Updated last year
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆23Apr 8, 2024Updated 2 years ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- sketch-rnn demo for seoul mediacity biennale 2018☆13Sep 4, 2018Updated 7 years ago
- ☆25Sep 3, 2025Updated 7 months ago
- ☆16Jul 20, 2023Updated 2 years ago
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Chrome extension that helps you stay focused by blocking sites during work timers and letting you browse during break timers. Now also …☆16Nov 22, 2018Updated 7 years ago
- Digital texts in Prakrit☆10Sep 14, 2025Updated 7 months ago
- Vector search over tweets from the tweet archive using OpenAI embeddings and LanceDB☆58Mar 25, 2024Updated 2 years ago
- Napari plugin for custom analysis and visualization of lattice lightsheet and Oblique Plane Microscopy data. The plugin is optimized for …☆14Updated this week
- An Offline Wikipedia Dump Reader in Javascript that probably only works on Chrome☆19Dec 23, 2011Updated 14 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Feb 27, 2024Updated 2 years ago
- ICML'19: How does Disagreement Help Generalization against Label Corruption?☆22Jun 30, 2019Updated 6 years ago
- ☆18Feb 20, 2024Updated 2 years ago
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [TMLR'25] Official implementation for "Large-Scale Targeted Cause Discovery via Learning from Simulated Data"☆27Sep 30, 2025Updated 6 months ago
- Repo for the CZI Imaging Team's napari plugin Alfa Cohort collaboration☆11May 14, 2021Updated 4 years ago
- ☆15Oct 24, 2023Updated 2 years ago
- Website for the MIT/Harvard Computational Neuroscience Journal Club☆11Apr 7, 2025Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- a fast implementation of BM25☆10Sep 15, 2022Updated 3 years ago
- ☆18Jun 12, 2023Updated 2 years ago