☆46Jul 21, 2025Updated 7 months ago
Alternatives and similar repositories for assignment3-scaling
Users that are interested in assignment3-scaling are comparing it to the libraries listed below
Sorting:
- ☆41Jul 21, 2025Updated 7 months ago
- ☆114Jul 21, 2025Updated 7 months ago
- Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch☆179Jul 25, 2025Updated 7 months ago
- ☆29Nov 30, 2025Updated 3 months ago
- ☆25Feb 20, 2026Updated 2 weeks ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆1,295Aug 29, 2025Updated 6 months ago
- ☆35Jul 5, 2023Updated 2 years ago
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Feb 11, 2026Updated 3 weeks ago
- Photonic Quantum Machine Learning Framework☆19Feb 18, 2026Updated 2 weeks ago
- ☆14Dec 20, 2021Updated 4 years ago
- Code for the paper: Solving and Learning Nonlinear PDEs with Gaussian Processes☆40Jul 17, 2025Updated 7 months ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- The MMFT ISO Designer is a tool that validates and generates microfluidic chip designs conforming to the ISO 22916 standard.☆15Feb 5, 2026Updated last month
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- Simple MoE - Day 17 of 365 Days of Repos☆17Jan 17, 2025Updated last year
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 4 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- A collection of high-performance, modular utilities for enhancing testing, transactional consistency, efficiency, security and stability …☆28Jan 26, 2026Updated last month
- Modern utility library and typescript typings for building JSON Schema documents☆14Nov 28, 2025Updated 3 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Dark Patterns in Chatbot Design☆17Jun 15, 2024Updated last year
- RCS Business Messaging upgrades SMS with branding, rich media, interactivity, and analytics. With RCS, businesses can bring branded, inte…☆13Feb 14, 2026Updated 3 weeks ago
- This sample demonstrates how to create an appointment setting agent for the Business Messages platform using Dialogflow and the Node.js S…☆17Feb 5, 2026Updated last month
- Boilerplate using one of the 'better' ways to build MCP Servers. Written using FastMCP☆18Apr 20, 2025Updated 10 months ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.☆13Nov 19, 2024Updated last year
- 📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.☆12Apr 18, 2025Updated 10 months ago
- Code for the paper: Proving Theorems Recursively☆12May 23, 2024Updated last year
- A Jupyter-style custom node for executing Python code and plotting within ComfyUI workflows.☆35Dec 16, 2025Updated 2 months ago
- ☆43Jan 27, 2026Updated last month
- Reachy2 Unity package to mirror a real or fake robot's state☆18Jul 18, 2025Updated 7 months ago
- 🔬 MCP server to query KumoRFM in your agentic flows☆29Updated this week
- This project demonstrates deploying a secure, scalable Generative AI (GenAI) solution on Azure using a Retrieval-Augmented Generation (RA…☆17Feb 27, 2025Updated last year
- Sister project to OpenLLMetry, but in Ruby. Open-source observability for your LLM application, based on OpenTelemetry☆14Feb 9, 2026Updated 3 weeks ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆52Feb 23, 2026Updated last week
- to release the source code for reproducing the results reported in our paper: https://arxiv.org/abs/2409.17550☆14Nov 15, 2024Updated last year
- Optimizing diffusion for production-ready speeds☆37Jan 10, 2026Updated last month
- ☆11Jun 20, 2023Updated 2 years ago