allenai/signal-and-noise

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/allenai/signal-and-noise)

allenai / signal-and-noise

Measuring the Signal to Noise Ratio in Language Model Evaluation

☆31

Alternatives and similar repositories for signal-and-noise

Users that are interested in signal-and-noise are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

allenai / fluid-benchmarking
View on GitHub
Fluid Language Model Benchmarking
☆29Sep 16, 2025Updated 10 months ago
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
rioyokotalab / swallow-code-math
View on GitHub
Ongoing research project for code&math LLMs
☆32Jul 4, 2025Updated last year
damek / specgd
View on GitHub
Code to generate figures of paper "When do spectral gradient updates help in deep learning?"
☆16Dec 3, 2025Updated 7 months ago
google-deepmind / dmc_vision_benchmark
View on GitHub
☆34Jun 21, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
drbh / yamoe
View on GitHub
🔀 yet another mixture of experts
☆23Jun 5, 2026Updated last month
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
Unakar / Spectral-Sphere-Optimizer
View on GitHub
Spectral Sphere Optimizer
☆131Mar 23, 2026Updated 4 months ago
OliverSieberling / dynamic-conv1d
View on GitHub
Triton kernels for dynamic causal short convolutions.
☆24Jun 4, 2026Updated last month
belindal / state-tracking
View on GitHub
Code and data for paper "(How) do Language Models Track State?"
☆26Mar 31, 2025Updated last year
sanderland / script_tok
View on GitHub
Code for the paper "BPE stays on SCRIPT", "Which Pieces Does Unigram Tokenization Really Need?" and MinGram
☆18Updated this week
facebookresearch / dmae_st
View on GitHub
Directed masked autoencoders
☆14Mar 25, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Michael-Beukman / RobocupGym
View on GitHub
Reinforcement Learning inside a 3D soccer simulation
☆37Sep 15, 2024Updated last year
allenai / DataDecide
View on GitHub
☆44Aug 20, 2025Updated 11 months ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
allenai / olmes
View on GitHub
Reproducible, flexible LLM evaluations
☆390Mar 24, 2026Updated 4 months ago
zzp1012 / SAM-in-Late-Phase
View on GitHub
[ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"
☆19Feb 20, 2025Updated last year
huanranchen / LLMLandscape
View on GitHub
The loss landscape of Large Language Models resemble basin!
☆41Jul 8, 2025Updated last year
facebookresearch / PhysicsLM4
View on GitHub
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality
☆356May 20, 2026Updated 2 months ago
mcleish7 / retrofitting-recurrence
View on GitHub
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
☆68Nov 11, 2025Updated 8 months ago
dunnolab / harmony
View on GitHub
[ICML 2026 GenBio Workshop] Official Implementation for "Harmonic Torsional Diffusion for Protein-Ligand Flexible Docking"
☆15Jun 30, 2026Updated 3 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
stefan-it / ukrainian-electra
View on GitHub
Ukrainian ELECTRA model
☆12Mar 11, 2023Updated 3 years ago
ttumiel / minRLHF
View on GitHub
Minimal RLHF implementation built on top of minGPT.
☆32Jul 4, 2024Updated 2 years ago
aHapBean / xHC
View on GitHub
[Tech Report] Expanded Hyper-Connections
☆55Jul 21, 2026Updated last week
FFTYYY / RaanA
View on GitHub
Implementation of "RaanA: A Fast, Flexible, and Data-Efficient Post-Training Quantization Algorithm"
☆17Apr 11, 2025Updated last year
richardodliu / OpenCodeEval
View on GitHub
☆52Mar 9, 2026Updated 4 months ago
LAION-AI / Conditional-Pretraining-of-Large-Language-Models
View on GitHub
☆37May 7, 2023Updated 3 years ago
zhuzilin / flash-attention-with-sink
View on GitHub
☆37Aug 7, 2025Updated 11 months ago
davidbrandfonbrener / color-filter-olmo
View on GitHub
☆13Dec 12, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sustcsonglin / linear-attention-and-beyond-slides
View on GitHub
☆119Feb 25, 2025Updated last year
Yifei-Zuo / Parallax
View on GitHub
Official repository for Parallax (Parameterized Local Linear Attention)
☆68Jul 7, 2026Updated 3 weeks ago
uiuctml / MergeBench
View on GitHub
[NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs
☆47Feb 11, 2026Updated 5 months ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
Rhoban / footstepnet_envs
View on GitHub
☆19Jul 4, 2025Updated last year
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago