An open-source implementation of Scaling Laws for Neural Language Models using nanoGPT
☆53Dec 8, 2023Updated 2 years ago
Alternatives and similar repositories for scaling_laws
Users that are interested in scaling_laws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The project proposal template for OpenBioML community projects.☆18Feb 9, 2023Updated 3 years ago
- Score LLM pretraining data with classifiers☆55Nov 2, 2023Updated 2 years ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆61Oct 2, 2024Updated last year
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆23Feb 13, 2025Updated last year
- An implementation of the paper Brain2Qwerty that translates brain EEG data into text for reading people's brains. There was no code so we…☆23Feb 9, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Hierarchical Image Representation☆10Dec 9, 2023Updated 2 years ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆64Mar 12, 2026Updated last month
- an attempt to beat the shit out of gnulib☆11Sep 19, 2011Updated 14 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated 11 months ago
- An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales☆16Jun 6, 2024Updated last year
- Material for the course of "Mathematics of Transformer"☆22Aug 3, 2025Updated 8 months ago
- Cross platform C++ threading library☆12Dec 11, 2015Updated 10 years ago
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shaping☆10Feb 27, 2020Updated 6 years ago
- The sorce code for the realtime video descriptors: HOG, HOF, MBH and HMG☆10Feb 6, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 逻辑回归和单层softmax的解析解☆12Jul 29, 2021Updated 4 years ago
- A+つくばは大 学の課題を効率よく十分な品質で提出することができない (A+が取れない!!)問題を解決したい 同じ講義に知り合いが少ない筑波大生向けの筑波大生専用の匿名学習支援SNSです。☆11Nov 23, 2025Updated 5 months ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- A GPU miner for the Zcash cryptocurrency.☆15Oct 20, 2016Updated 9 years ago
- 1⃣ Example of using the django-celery module.☆15Sep 22, 2016Updated 9 years ago
- ☆11Dec 6, 2022Updated 3 years ago
- An efficient hierarchical Graph-based RAG☆38Nov 27, 2025Updated 5 months ago
- Implementation of paper "Transferring Robustness for Graph Neural Network Against Poisoning Attacks".☆20Feb 26, 2020Updated 6 years ago
- ☆17May 13, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Edward中文文档☆16Apr 11, 2017Updated 9 years ago
- DoubleAI’s hyperoptimised version of cuGraph☆51Mar 3, 2026Updated last month
- ☆34Sep 10, 2024Updated last year
- It's the C++ Package Manager Manager☆16Feb 15, 2021Updated 5 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆48Oct 31, 2023Updated 2 years ago
- ☆15Mar 2, 2011Updated 15 years ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆25Mar 1, 2026Updated last month
- LLM training in simple, raw C/CUDA☆15Dec 5, 2024Updated last year
- Python tools for VMD☆10Jun 3, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 4 months ago
- Notionに毎日新しいarXiv論文のアブスト ラクト日本語訳 + αを表示するスクリプト☆13Jan 22, 2023Updated 3 years ago
- Deluge plugin to stop torrents after seeding for a certain amount of time.☆12Jun 24, 2019Updated 6 years ago
- Language models scale reliably with over-training and on downstream tasks☆101Apr 2, 2024Updated 2 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- ☆18Sep 29, 2020Updated 5 years ago
- ☆13Jul 4, 2020Updated 5 years ago