mcleish7/gemstone-scaling-laws

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mcleish7/gemstone-scaling-laws)

mcleish7 / gemstone-scaling-laws

Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)

☆35

Alternatives and similar repositories for gemstone-scaling-laws

Users that are interested in gemstone-scaling-laws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

montehoover / DynaGuard
View on GitHub
Code for "DynaGuard: A Dynamic Guardrail Model With User-Defined Policies."
☆23Nov 3, 2025Updated 8 months ago
facebookresearch / scalable-curvature
View on GitHub
Code for Dayal Kalra's research internship on scalable curvature measures for neural networks.
☆29Feb 3, 2026Updated 5 months ago
morse-benchmark / morse-500
View on GitHub
☆31May 21, 2026Updated 2 months ago
ahans30 / goldfish-loss
View on GitHub
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆98Nov 17, 2024Updated last year
hamidkazemi22 / CLIPInversion
View on GitHub
What do we learn from inverting CLIP models?
☆58Mar 6, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
JonasGeiping / dataaugs
View on GitHub
☆18Oct 12, 2022Updated 3 years ago
mcleish7 / retrofitting-recurrence
View on GitHub
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
☆68Nov 11, 2025Updated 8 months ago
mcleish7 / arithmetic
View on GitHub
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆200May 28, 2024Updated 2 years ago
hsouri / bob-detection
View on GitHub
☆12Oct 20, 2023Updated 2 years ago
neelsjain / baseline-defenses
View on GitHub
Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"
☆34Oct 26, 2023Updated 2 years ago
facebookresearch / rl-injector
View on GitHub
Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks
☆53May 6, 2026Updated 2 months ago
hsouri / bob-classification
View on GitHub
☆11Oct 20, 2023Updated 2 years ago
vasusingla / simple-data-attribution
View on GitHub
A simple and efficient baseline for data attribution
☆11Nov 10, 2023Updated 2 years ago
neelsjain / BYOD
View on GitHub
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆108Sep 23, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
azshue / AutoPoison
View on GitHub
The official repository of the paper "On the Exploitability of Instruction Tuning".
☆70Feb 5, 2024Updated 2 years ago
dayal-kalra / low-memory-adam
View on GitHub
☆14Mar 2, 2025Updated last year
JonasGeiping / carving
View on GitHub
Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives
☆71Feb 22, 2024Updated 2 years ago
facebookresearch / zero
View on GitHub
PyTorch Implementation of Zero-Shot Vision Encoder Grafting via LLM Surrogates [ICCV'25]
☆54Jul 10, 2025Updated last year
AminJun / ImageNet1KBoundingBoxes
View on GitHub
Pytorch ImageNet1k Loader with Bounding Boxes.
☆13Jan 23, 2022Updated 4 years ago
model-similarity / lm-similarity
View on GitHub
☆21Feb 10, 2025Updated last year
flash-bon / flash-bon
View on GitHub
(ECCV 2026): Official code for Flash-BoN: Instant Drafts for Inference-Time Scaling in Diffusion Models
☆18Jul 9, 2026Updated 2 weeks ago
hpcgroup / loki
View on GitHub
Algorithms for approximate attention in LLMs
☆22Apr 14, 2025Updated last year
JonasGeiping / fullbatchtraining
View on GitHub
Training vision models with full-batch gradient descent and regularization
☆40Feb 14, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
juzhengz / logit-fusion
View on GitHub
Learning from Mixed Rollouts: Logit Fusion as a Bridge Between Imitation and Exploration
☆17Feb 24, 2026Updated 5 months ago
RenkunNi / MetaContrastive
View on GitHub
The official code for the publication: "The Close Relationship Between Contrastive Learning and Meta-Learning".
☆18Sep 19, 2022Updated 3 years ago
axonn-ai / axonn
View on GitHub
Parallel framework for training and fine-tuning deep neural networks
☆74Apr 28, 2026Updated 3 months ago
somepago / DCR
View on GitHub
Official Pytorch repo of CVPR'23 and NeurIPS'23 papers on understanding replication in diffusion models.
☆113Nov 22, 2023Updated 2 years ago
Ping-C / optimizer
View on GitHub
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆39Mar 2, 2023Updated 3 years ago
j-alex-hanson / gaussian-splatting-pup
View on GitHub
☆148Nov 22, 2025Updated 8 months ago
aks2203 / easy-to-hard-data
View on GitHub
Pytorch Datasets for Easy-To-Hard
☆30Jan 9, 2025Updated last year
LeonLixyz / LCLM
View on GitHub
latent context language models
☆72Jun 9, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hsouri / Battle-of-the-Backbones
View on GitHub
☆212Nov 2, 2023Updated 2 years ago
lhfowl / robbing_the_fed
View on GitHub
☆26Dec 14, 2021Updated 4 years ago
tuallen / speede3dgs
View on GitHub
☆109Jun 8, 2026Updated last month
YuxinWenRick / hard-prompts-made-easy
View on GitHub
☆648Aug 4, 2023Updated 2 years ago
katiekang1998 / reasoning_generalization
View on GitHub
☆33Jan 7, 2025Updated last year
facebookresearch / sphere-encoder
View on GitHub
PyTorch Implementation of Image Generation with a Sphere Encoder
☆44May 20, 2026Updated 2 months ago
chenllliang / MMEvalPro
View on GitHub
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
☆25Sep 26, 2024Updated last year