gzip Predicts Data-dependent Scaling Laws
☆34May 28, 2024Updated last year
Alternatives and similar repositories for complexity-scaling
Users that are interested in complexity-scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 31, 2024Updated last year
- ☆27Jul 9, 2024Updated last year
- ☆25May 7, 2025Updated 10 months ago
- ☆18Mar 18, 2024Updated 2 years ago
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆28May 27, 2025Updated 10 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆54May 20, 2024Updated last year
- Adversarial Training and SFT for Bot Safety Models☆40Apr 18, 2023Updated 2 years ago
- ☆44Jun 19, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 4 years ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- #UAI2020 Codes for PAC-Bayesian Contrastive Unsupervised Representation Learning☆14May 23, 2022Updated 3 years ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆23Jun 5, 2025Updated 9 months ago
- ☆15Apr 2, 2025Updated 11 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆53Oct 29, 2024Updated last year
- ☆13Apr 25, 2024Updated last year
- PyTorch implementation for MRL☆22Feb 22, 2024Updated 2 years ago
- Tools for formatting large language model prompts.☆13Dec 19, 2023Updated 2 years ago
- ☆25Sep 3, 2025Updated 6 months ago
- Scripts for quantifying stuff from my life☆19Nov 1, 2015Updated 10 years ago
- Simple Transformer in Jax☆143Jun 22, 2024Updated last year
- playing with gpt4☆14Mar 17, 2023Updated 3 years ago
- A toolkit for scaling law research ⚖☆59Jan 27, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆23Apr 8, 2024Updated last year
- Code for "Transformer-Based Deep Survival Analysis"☆12May 27, 2022Updated 3 years ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- ☆19Nov 4, 2025Updated 4 months ago
- sketch-rnn demo for seoul mediacity biennale 2018☆13Sep 4, 2018Updated 7 years ago
- ☆16Jul 20, 2023Updated 2 years ago
- Bastien One is being developed as autonomous A.I. bot with the capacity to complete complex tasks - either by itself or by creating addit…☆16Mar 9, 2025Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆21Dec 27, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Feb 27, 2024Updated 2 years ago
- ICML'19: How does Disagreement Help Generalization against Label Corruption?☆22Jun 30, 2019Updated 6 years ago
- ☆18Feb 20, 2024Updated 2 years ago
- Code for ACL 2023 paper titled "Lifting the Curse of Capacity Gap in Distilling Language Models"☆30Jul 14, 2023Updated 2 years ago
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- Repo for the CZI Imaging Team's napari plugin Alfa Cohort collaboration☆11May 14, 2021Updated 4 years ago