ethancaballero/broken_neural_scaling_laws

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ethancaballero/broken_neural_scaling_laws)

ethancaballero / broken_neural_scaling_laws

Code Release for "Broken Neural Scaling Laws" (BNSL) paper

☆59

Alternatives and similar repositories for broken_neural_scaling_laws

Users that are interested in broken_neural_scaling_laws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
monasticacademy / logical-induction
View on GitHub
Code to support the guide to logical induction for software engineers
☆11Jul 12, 2026Updated 2 weeks ago
ahujak / IB-IRM
View on GitHub
☆32Oct 13, 2021Updated 4 years ago
allenai / easy-to-hard-generalization
View on GitHub
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Jan 17, 2024Updated 2 years ago
Linear95 / DSP
View on GitHub
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
locuslab / scaling_laws_data_filtering
View on GitHub
☆64Apr 9, 2024Updated 2 years ago
TjuJianyu / RFC
View on GitHub
☆12Jul 17, 2023Updated 3 years ago
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
yaroslavvb / imperative
View on GitHub
imperative programming in TensorFlow
☆18Dec 12, 2016Updated 9 years ago
Yuanhy1997 / Auto-Diagnosis-by-RL-and-Classification
View on GitHub
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification [AI in Medicine Journal]
☆14May 20, 2022Updated 4 years ago
mlfoundations / scaling
View on GitHub
Language models scale reliably with over-training and on downstream tasks
☆102Apr 2, 2024Updated 2 years ago
samuela / git-re-basin
View on GitHub
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
☆515Mar 7, 2023Updated 3 years ago
piotr-teterwak / erm_plusplus
View on GitHub
☆17Jun 20, 2024Updated 2 years ago
EleutherAI / pile_dedupe
View on GitHub
Pile Deduplication Code
☆18May 15, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
noemaresearch / pinboard
View on GitHub
Pin files for contextual, codebase-level AI assistance.
☆16Jul 11, 2024Updated 2 years ago
ekinakyurek / google-research
View on GitHub
Google Research
☆47Oct 29, 2022Updated 3 years ago
EleutherAI / mdl
View on GitHub
Minimum Description Length probing for neural network representations
☆20Jan 28, 2025Updated last year
bethgelab / InDomainGeneralizationBenchmark
View on GitHub
☆35Aug 30, 2021Updated 4 years ago
Joshua-Ren / Neural_Iterated_Learning
View on GitHub
Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).
☆16Oct 14, 2021Updated 4 years ago
vsahil / MIMETIC-2
View on GitHub
Official Code for MIMETIC^2
☆13Nov 19, 2024Updated last year
euclaise / supertrainer2000
View on GitHub
☆50Mar 14, 2024Updated 2 years ago
sg-wbi / belb
View on GitHub
Biomedical Entity Linking Benchmark
☆14Dec 10, 2024Updated last year
chmathys-teaching-f22 / methods-2-course
View on GitHub
Methods 2: The General Linear Model
☆15May 5, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Timonzimm / poincare-embedding
View on GitHub
Poincaré Embeddings for Learning Hierarchical Representations (https://arxiv.org/abs/1705.08039) in PyTorch
☆15Dec 20, 2017Updated 8 years ago
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
bneyshabur / over-parametrization
View on GitHub
Computing various norms/measures on over-parametrized neural networks
☆50Nov 26, 2018Updated 7 years ago
tml-tuebingen / chatgpt-algorithm-exam
View on GitHub
ChatGPT Participates in a Computer Science Exam (2023)
☆31Mar 21, 2023Updated 3 years ago
MilesCranmer / pysr_scaling_laws
View on GitHub
You should use PySR to find scaling laws. Here's an example.
☆34Sep 30, 2023Updated 2 years ago
Yuanhy1997 / GenBioEL
View on GitHub
Generative Biomedical Entity Linking via Knowledge Base-Guided Pre-training and Synonyms-Aware Fine-tuning [NAACL 2022]
☆19Jan 27, 2023Updated 3 years ago
hadasah / btm
View on GitHub
☆79Apr 29, 2024Updated 2 years ago
SivanDoveh / DAC
View on GitHub
Repository for the paper: dense and aligned captions (dac) promote compositional reasoning in vl models
☆28Nov 29, 2023Updated 2 years ago
ermongroup / lagvae
View on GitHub
Lagrangian VAE
☆28Jul 27, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
neelsjain / BYOD
View on GitHub
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆108Sep 23, 2023Updated 2 years ago
gibipara92 / learning-explanations-hard-to-vary
View on GitHub
Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …
☆41Nov 12, 2020Updated 5 years ago
manantomar / video-occupancy-models
View on GitHub
☆13Jul 16, 2024Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
3B-Group / ConvRe
View on GitHub
🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)
☆24Oct 10, 2023Updated 2 years ago
LuoXiaoHeics / Continual-Tune
View on GitHub
☆10Feb 6, 2025Updated last year
KaiserWhoLearns / Effective-Attention-Interpretability
View on GitHub
Effective Attention Sheds Light On Interpretability - Findings of ACL2021
☆11May 16, 2021Updated 5 years ago