shehper/scaling_laws

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shehper/scaling_laws)

shehper / scaling_laws

An open-source implementation of Scaling Laws for Neural Language Models using nanoGPT

☆55

Alternatives and similar repositories for scaling_laws

Users that are interested in scaling_laws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyo-takano / chinchilla
View on GitHub
A toolkit for scaling law research ⚖
☆68Jan 27, 2025Updated last year
OpenBioML / project-proposal-template
View on GitHub
The project proposal template for OpenBioML community projects.
☆18Feb 9, 2023Updated 3 years ago
VikParuchuri / classified
View on GitHub
Score LLM pretraining data with classifiers
☆54Nov 2, 2023Updated 2 years ago
rudinger / defeasible-nli
View on GitHub
Defeasible Natural Language Inference
☆14Dec 4, 2020Updated 5 years ago
ryoungj / ObsScaling
View on GitHub
[NeurIPS'24 Spotlight] Observational Scaling Laws
☆60Oct 2, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CausalML / MultipleLoggers
View on GitHub
Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"
☆15Jul 17, 2021Updated 5 years ago
jajcayn / pyclits
View on GitHub
Python Climate Time Series package
☆11Nov 15, 2022Updated 3 years ago
brianbader / nyc_taxi
View on GitHub
Analysis of NYC taxi trip data
☆11Dec 5, 2016Updated 9 years ago
Shai128 / mqr
View on GitHub
Multiple-Output Quantile Regression
☆16Oct 9, 2021Updated 4 years ago
CorrelAid / correlaid-tidytuesday
View on GitHub
Repository for collecting analyses and results for tidytuesday from CorrelAid members
☆10Apr 11, 2023Updated 3 years ago
lucaslingle / mu_transformer
View on GitHub
Official implementation of 'A Large-Scale Exploration of mu-Transfer' (CoRR 2024)
☆31Jun 5, 2025Updated last year
alonsosilvaallende / COVID-19
View on GitHub
France compared to Italy
☆10Jan 9, 2022Updated 4 years ago
nestordemeure / flaxOptimizers
View on GitHub
A collection of optimizers, some arcane others well known, for Flax.
☆29Aug 6, 2021Updated 4 years ago
Heidelberg-NLP / MHKA
View on GitHub
The corresponding code from our paper "Social Commonsense Reasoning with Multi-Head Knowledge Attention (EMNLP 2020)". Do not hesitate to…
☆11Jun 12, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chamorajg / pl-dreamer
View on GitHub
Simplistic Pytorch Implementation of the Dreamer-RL
☆20May 7, 2025Updated last year
mszell / taxonomybikenw
View on GitHub
Taxonomy of urban bicycle network approaches
☆14Nov 26, 2025Updated 7 months ago
meyerscetbon / LinearSinkhorn
View on GitHub
☆17Oct 22, 2020Updated 5 years ago
embedded-sec / uRAI
View on GitHub
Securing Embedded Systems with Return Address Integrity
☆16Aug 19, 2024Updated last year
GameDisplayer / DRL4DG
View on GitHub
Deep Reinforcement Learning for Dialogue Generation using SEQ2SEQ model
☆11Feb 23, 2021Updated 5 years ago
technologiestiftung / bike-sharing
View on GitHub
☆14Dec 10, 2019Updated 6 years ago
MarvinChung / HW5-TextStyleTransfer
View on GitHub
☆15Mar 17, 2021Updated 5 years ago
discus0434 / evaluate-images-to-feed-diffusion
View on GitHub
Small notebook to preprocess and evaluate images.
☆14Nov 11, 2022Updated 3 years ago
hieudx149 / X-RetroMAE
View on GitHub
Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder
☆10Mar 16, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
half-blue / A_plus_Tsukuba
View on GitHub
A+つくばは大学の課題を効率よく十分な品質で提出することができない (A+が取れない!!)問題を解決したい同じ講義に知り合いが少ない筑波大生向けの筑波大生専用の匿名学習支援SNSです。
☆11Nov 23, 2025Updated 7 months ago
ASK-Berkeley / graph-free-transformer
View on GitHub
☆16Feb 9, 2026Updated 5 months ago
coldenate / zotero-remnote-connector
View on GitHub
A Citation Manager and Zotero Integration for RemNote! Cite research all within your knowledge base!
☆29Jan 22, 2026Updated 5 months ago
GeeeekExplorer / kkbot
View on GitHub
A Feishu/Lark AI agent bot
☆15Feb 27, 2026Updated 4 months ago
vishnukanduri / Customer-Analytics-in-Python
View on GitHub
I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…
☆11Apr 26, 2020Updated 6 years ago
mat701 / BiCM
View on GitHub
Python package for the computation of the Bipartite Configuration Model.
☆21May 23, 2025Updated last year
cloneofsimo / zeroshampoo
View on GitHub
☆33Sep 10, 2024Updated last year
kmbmjn / search_conference_name_of_paper
View on GitHub
☆11Jun 4, 2021Updated 5 years ago
mitmedialab / Basic
View on GitHub
Agent Based Simulation platform for CityScope
☆16Aug 30, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
HazyResearch / skill-it
View on GitHub
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆48Oct 31, 2023Updated 2 years ago
pleonova / data-diary
View on GitHub
My life dashboard - automatically track and visualize your data. Using common tracker APIs to create a minute by minute representation of…
☆20Feb 25, 2021Updated 5 years ago
hartmutlentz / TemporalNetworkAccessibility
View on GitHub
Provides classes for Adjacency Matrix Sequences and Temporal Network Edgelists.
☆20Sep 27, 2024Updated last year
YuchenJin / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆15Dec 5, 2024Updated last year
demattia / usercode
View on GitHub
My usercode area
☆15Dec 21, 2016Updated 9 years ago
sail-sg / PatchAIL
View on GitHub
Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>
☆14Feb 15, 2023Updated 3 years ago
google-deepmind / asyncdiloco
View on GitHub
☆51Jan 18, 2024Updated 2 years ago