An open-source implementation of Scaling Laws for Neural Language Models using nanoGPT
☆52Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for scaling_laws
Users that are interested in scaling_laws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toolkit for scaling law research ⚖☆63Jan 27, 2025Updated last year
- Score LLM pretraining data with classifiers☆55Nov 2, 2023Updated 2 years ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆61Oct 2, 2024Updated last year
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Jul 17, 2021Updated 4 years ago
- To be a next-generation DL-based phenotype prediction from genome mutations.☆19May 17, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A collection of optimizers, some arcane others well known, for Flax.☆29Aug 6, 2021Updated 4 years ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆65Mar 12, 2026Updated 2 months ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated last year
- ☆17Oct 22, 2020Updated 5 years ago
- An algorithm that intelligently executes a crypto order over time via Coinbase☆13Oct 26, 2021Updated 4 years ago
- Experiments on the impact of depth in transformers and SSMs.☆40Oct 23, 2025Updated 6 months ago
- ☆15Apr 23, 2026Updated 3 weeks ago
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- Subscribe Loomo published image messages and process☆10Oct 22, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Filter RSS Feed with GPT-4☆16May 22, 2023Updated 2 years ago
- A+つくばは大学の課題を効率よく十分な品質で提出することができない (A+が取れない!!)問題を解決したい 同じ講義に知り合いが少ない筑波大生向けの筑波大生専用の匿名学習支援SNSです。☆11Nov 23, 2025Updated 5 months ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- This is an interactive mock-up of the SpaceX Dragon 2 spacecraft's user interface. It contains 5 panels and multiple amusing features. A …☆11Jul 28, 2022Updated 3 years ago
- ☆11Dec 6, 2022Updated 3 years ago
- ☆12Nov 28, 2018Updated 7 years ago
- ☆34Sep 10, 2024Updated last year
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Oct 31, 2023Updated 2 years ago
- ☆11Apr 23, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Notionに毎日新しいarXiv論文のアブストラクト日本語訳 + αを表示するスクリプト☆12Jan 22, 2023Updated 3 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 5 months ago
- My usercode area☆15Dec 21, 2016Updated 9 years ago
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated 2 years ago
- ☆11Sep 29, 2021Updated 4 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- ☆13Jul 4, 2020Updated 5 years ago
- ☆123May 13, 2026Updated last week
- Materials and exercises for SICP☆15Feb 13, 2017Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Pile Deduplication Code☆18May 15, 2023Updated 3 years ago
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 10 months ago
- ☆12Jun 19, 2022Updated 3 years ago
- ☆80Oct 3, 2023Updated 2 years ago
- Full Marks | Auditing CS61B Data Structures, Spring 2021☆13Jul 31, 2023Updated 2 years ago
- ☆12Dec 13, 2023Updated 2 years ago