stanford-cs336/assignment3-scaling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stanford-cs336/assignment3-scaling)

stanford-cs336 / assignment3-scaling

☆46

Alternatives and similar repositories for assignment3-scaling

Users that are interested in assignment3-scaling are comparing it to the libraries listed below

Sorting:

stanford-cs336 / assignment4-data
View on GitHub
☆41Jul 21, 2025Updated 7 months ago
stanford-cs336 / assignment5-alignment
View on GitHub
☆114Jul 21, 2025Updated 7 months ago
stanford-cs336 / assignment2-systems
View on GitHub
Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch
☆179Jul 25, 2025Updated 7 months ago
keyonvafa / inductive-bias-probes
View on GitHub
☆29Nov 30, 2025Updated 3 months ago
aadityasingh / icl-dynamics
View on GitHub
☆25Feb 20, 2026Updated 2 weeks ago
AnonymousNIPS2019 / DeepnetHessian
View on GitHub
The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size
☆19May 19, 2019Updated 6 years ago
stanford-cs336 / assignment1-basics
View on GitHub
Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
☆1,295Aug 29, 2025Updated 6 months ago
allenbai01 / transformers-as-statisticians
View on GitHub
☆35Jul 5, 2023Updated 2 years ago
OpenMined / syft-flwr
View on GitHub
Easy Setup, File-based, Offline Capable Federated Learning and Computations
☆22Feb 11, 2026Updated 3 weeks ago
merlinquantum / merlin
View on GitHub
Photonic Quantum Machine Learning Framework
☆19Feb 18, 2026Updated 2 weeks ago
AdobeDocs / cc-libraries-api-samples
View on GitHub
☆14Dec 20, 2021Updated 4 years ago
yifanc96 / NonLinPDEs-GPsolver
View on GitHub
Code for the paper: Solving and Learning Nonlinear PDEs with Gaussian Processes
☆40Jul 17, 2025Updated 7 months ago
sri9s / tinystories-language-models
View on GitHub
Exploring the minimal architecture required for coherent English language generation.
☆12Mar 5, 2025Updated last year
cda-tum / mmft-iso-designer
View on GitHub
The MMFT ISO Designer is a tool that validates and generates microfluidic chip designs conforming to the ISO 22916 standard.
☆15Feb 5, 2026Updated last month
Ikemura-kei / RM2021-Tutorial
View on GitHub
This is a repository for RM2021 Software tutorial
☆11Nov 4, 2020Updated 5 years ago
statwangz / MATH-4432-Statistical-Machine-Learning
View on GitHub
Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022
☆11Sep 17, 2024Updated last year
peytontolbert / simple-moe
View on GitHub
Simple MoE - Day 17 of 365 Days of Repos
☆17Jan 17, 2025Updated last year
kvignesh1420 / cot-icl-lab
View on GitHub
[ACL 2025] Official implementation of the "CoT-ICL Lab" framework
☆11Oct 10, 2025Updated 4 months ago
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
andygeiss / cloud-native-utils
View on GitHub
A collection of high-performance, modular utilities for enhancing testing, transactional consistency, efficiency, security and stability …
☆28Jan 26, 2026Updated last month
triggerdotdev / json-schema-fns
View on GitHub
Modern utility library and typescript typings for building JSON Schema documents
☆14Nov 28, 2025Updated 3 months ago
GuoTianYu2000 / Active-Dormant-Attention
View on GitHub
codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"
☆10Dec 30, 2024Updated last year
esbenkc / DarkGPT
View on GitHub
Dark Patterns in Chatbot Design
☆17Jun 15, 2024Updated last year
google-business-communications / nodejs-rcsbusinessmessaging
View on GitHub
RCS Business Messaging upgrades SMS with branding, rich media, interactivity, and analytics. With RCS, businesses can bring branded, inte…
☆13Feb 14, 2026Updated 3 weeks ago
google-business-communications / bm-nodejs-appointment-bot
View on GitHub
This sample demonstrates how to create an appointment setting agent for the Business Messages platform using Dialogflow and the Node.js S…
☆17Feb 5, 2026Updated last month
josharsh / mcp-server-boilerplate
View on GitHub
Boilerplate using one of the 'better' ways to build MCP Servers. Written using FastMCP
☆18Apr 20, 2025Updated 10 months ago
TsinghuaC3I / FS-GEN
View on GitHub
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
☆13Nov 19, 2024Updated last year
ArcadeAI / social-media-agent
View on GitHub
📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.
☆12Apr 18, 2025Updated 10 months ago
wiio12 / POETRY
View on GitHub
Code for the paper: Proving Theorems Recursively
☆12May 23, 2024Updated last year
liusida / ComfyUI-Notebook
View on GitHub
A Jupyter-style custom node for executing Python code and plotting within ComfyUI workflows.
☆35Dec 16, 2025Updated 2 months ago
idoatad / TensorLens
View on GitHub
☆43Jan 27, 2026Updated last month
pollen-robotics / Reachy2-UnityDigitalTwin
View on GitHub
Reachy2 Unity package to mirror a real or fake robot's state
☆18Jul 18, 2025Updated 7 months ago
kumo-ai / kumo-rfm-mcp
View on GitHub
🔬 MCP server to query KumoRFM in your agentic flows
☆29Updated this week
jonathanscholtes / Azure-AI-RAG-Architecture-React-FastAPI-and-Cosmos-DB-Vector-Store
View on GitHub
This project demonstrates deploying a secure, scalable Generative AI (GenAI) solution on Azure using a Retrieval-Augmented Generation (RA…
☆17Feb 27, 2025Updated last year
traceloop / openllmetry-ruby
View on GitHub
Sister project to OpenLLMetry, but in Ruby. Open-source observability for your LLM application, based on OpenTelemetry
☆14Feb 9, 2026Updated 3 weeks ago
lmarena / search-arena
View on GitHub
⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".
☆52Feb 23, 2026Updated last week
SonyResearch / SVG_baseline
View on GitHub
to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550
☆14Nov 15, 2024Updated last year
a-r-r-o-w / productionizing-diffusion
View on GitHub
Optimizing diffusion for production-ready speeds
☆37Jan 10, 2026Updated last month
ws-jiang / awesome-sharpeness-aware-minimization
View on GitHub
☆11Jun 20, 2023Updated 2 years ago