stanford-cs336 / assignment3-scalingView external linksLinks
☆44Jul 21, 2025Updated 6 months ago
Alternatives and similar repositories for assignment3-scaling
Users that are interested in assignment3-scaling are comparing it to the libraries listed below
Sorting:
- ☆41Jul 21, 2025Updated 6 months ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆165Jul 25, 2025Updated 6 months ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Benchmarking Optimizers for LLM Pretraining☆50Dec 30, 2025Updated last month
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆57Updated this week
- ☆56Sep 17, 2025Updated 4 months ago
- ☆35Jul 5, 2023Updated 2 years ago
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆21Updated this week
- OpenAI 2025 ICPC Submissions☆58Sep 17, 2025Updated 4 months ago
- 🔥🔥🔥 AI security automation platform. Build visual workflows, deploy autonomous agents, and automate threat detection and response. 80+…☆27Updated this week
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- ☆11Apr 17, 2023Updated 2 years ago
- ☆39Jan 27, 2026Updated 2 weeks ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- ☆26Updated this week
- ☆10Mar 8, 2025Updated 11 months ago
- OpenROAD Agent. This repository contain the model to train and testing the model using EDA Corpus dataset.☆21Jul 24, 2025Updated 6 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- The MMFT ISO Designer is a tool that validates and generates microfluidic chip designs conforming to the ISO 22916 standard.☆15Feb 5, 2026Updated last week
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Boilerplate using one of the 'better' ways to build MCP Servers. Written using FastMCP☆17Apr 20, 2025Updated 9 months ago
- Simple MoE - Day 17 of 365 Days of Repos☆16Jan 17, 2025Updated last year
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated 11 months ago
- A tool to view the total transactions, received, sent, and current balance of Bitcoin wallets 👁☆17Aug 19, 2025Updated 5 months ago
- Reachy2 Unity package to mirror a real or fake robot's state☆19Jul 18, 2025Updated 6 months ago
- This repository provides the source code for the paper Reinforcement Learning for Active Perception in Autonomous Navigation.☆38Feb 5, 2026Updated last week
- A Jupyter-style custom node for executing Python code and plotting within ComfyUI workflows.☆35Dec 16, 2025Updated last month
- This sample demonstrates how to create an appointment setting agent for the Business Messages platform using Dialogflow and the Node.js S…☆17Feb 5, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 27, 2023Updated 2 years ago
- Ops files for https//github.com/meta-llama/llama-stack☆16Jun 28, 2025Updated 7 months ago
- Code for the paper: Proving Theorems Recursively☆12May 23, 2024Updated last year
- Links to recourses for the Lean Theorem Prover☆12Dec 3, 2019Updated 6 years ago
- 📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.☆12Apr 18, 2025Updated 9 months ago
- 🔬 MCP server to query KumoRFM in your agentic flows☆29Feb 2, 2026Updated last week
- Covid Center Bot with Wit.AI☆12Nov 30, 2020Updated 5 years ago
- exercise for transformers-benchmarks, add 3090 benchmark☆13Feb 3, 2023Updated 3 years ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆49Jan 26, 2026Updated 2 weeks ago
- Optimizing diffusion for production-ready speeds☆34Jan 10, 2026Updated last month
- ☆11Jun 20, 2023Updated 2 years ago