thib-s/flash-newton-schulz

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thib-s/flash-newton-schulz)

thib-s / flash-newton-schulz

My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.

☆38

Alternatives and similar repositories for flash-newton-schulz

Users that are interested in flash-newton-schulz are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nil0x9 / flash-muon
View on GitHub
Flash-Muon: An Efficient Implementation of Muon Optimizer
☆258Jun 15, 2025Updated last year
NoahAmsel / PolarExpress
View on GitHub
☆33Jul 6, 2026Updated 3 weeks ago
zhehangdu / Newton-Muon
View on GitHub
The Newton-Muon optimizer
☆30Jun 5, 2026Updated last month
xie-lab-ml / Mano-Restriking-Manifold-Optimization-for-LLM-Training
View on GitHub
The official code of "Mano: Restriking Manifold Optimization for LLM Training".
☆25Jun 1, 2026Updated last month
LIONS-EPFL / scion
View on GitHub
☆70Apr 8, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Sphere-AI-Lab / poet
View on GitHub
Implementation for POET and POET-X for LLM pretraining
☆38Jun 9, 2026Updated last month
edwardmilsom / function-space-learning-rates-paper
View on GitHub
Code for the paper "Function-Space Learning Rates"
☆23Jun 3, 2025Updated last year
deel-ai / orthogonium
View on GitHub
New implementations of old orthogonal layers unlock large scale training.
☆31Sep 19, 2025Updated 10 months ago
SDLAML / disco
View on GitHub
☆16Dec 11, 2025Updated 7 months ago
damek / specgd
View on GitHub
Code to generate figures of paper "When do spectral gradient updates help in deep learning?"
☆16Dec 3, 2025Updated 7 months ago
Sphere-AI-Lab / pion
View on GitHub
☆36Jul 2, 2026Updated 3 weeks ago
ULTRA-HOI / HOI4-ULTRA-Project
View on GitHub
Github Repository for the HOI4 ULTRA Project.
☆11Updated this week
Unakar / Spectral-Sphere-Optimizer
View on GitHub
Spectral Sphere Optimizer
☆131Mar 23, 2026Updated 4 months ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
leloykun / adaptive-muon
View on GitHub
A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they cha…
☆19Jan 11, 2025Updated last year
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 6 months ago
zhixuan-lin / forgetting-transformer
View on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
☆150Feb 25, 2026Updated 5 months ago
open-climate-tech / firecam
View on GitHub
Detect wildfires using ML on images from cameras on vantage points
☆11Jun 6, 2026Updated last month
david3684 / AdaRank
View on GitHub
Official codebase for AdaRank: Adaptive Rank Pruning for Enhanced Model Merging (ICLR 2026)
☆20Jan 26, 2026Updated 6 months ago
test-time-training / ttt-tk
View on GitHub
☆45Nov 1, 2025Updated 8 months ago
NVIDIA-NeMo / Emerging-Optimizers
View on GitHub
☆240Updated this week
MLI-lab / early_stopping_double_descent
View on GitHub
Code for reproducing figures and results in the paper ``Early stopping in deep networks: Double descent and how to eliminate it''
☆15Jun 27, 2022Updated 4 years ago
XuezheMax / gecko-llm
View on GitHub
Gecko Architecture
☆16Jan 13, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Infrasys-AI / aiinfra-docs
View on GitHub
☆21Nov 6, 2025Updated 8 months ago
BienLuky / CacheQuant
View on GitHub
[CVPR 2025] The official implementation of "CacheQuant: Comprehensively Accelerated Diffusion Models"
☆48Nov 2, 2025Updated 8 months ago
nikhilvyas / SOAP_MUON
View on GitHub
Combining SOAP and MUON
☆25Feb 11, 2025Updated last year
meta-pytorch / spmd_types
View on GitHub
This module defines a type system for distributed training code, based off of JAX's sharding in types, but adapted for the PyTorch ecosys…
☆35Updated this week
leoandeol / cods
View on GitHub
🐟 CODS - Conformal Object Detection and Segmentation
☆22Updated this week
Dao-AILab / gram-newton-schulz
View on GitHub
Fast Polar Decomposition for Muon
☆169Jul 2, 2026Updated 3 weeks ago
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated last year
tldoan / PCA-OGD
View on GitHub
Code for PCA-OGD (AISTATS 2021)
☆11Mar 16, 2021Updated 5 years ago
KuangjuX / cuda-evolve-oss
View on GitHub
Autonomous GPU kernel optimization system driven by AI agents.
☆31Mar 29, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xie-lab-ml / Meissonic-Inference
View on GitHub
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Nov 21, 2024Updated last year
osayamenja / FlashMoE
View on GitHub
Distributed MoE in a Single Kernel [NeurIPS '25]
☆280May 5, 2026Updated 2 months ago
Infini-AI-Lab / gsm_infinite
View on GitHub
☆65Jun 12, 2025Updated last year
mit-han-lab / patch_conv
View on GitHub
Patch convolution to avoid large GPU memory usage of Conv2D
☆95Jan 23, 2025Updated last year
RLG-Leiden / edugym
View on GitHub
☆15Sep 22, 2023Updated 2 years ago
CarperAI / nmmo-environment
View on GitHub
Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
☆15May 30, 2024Updated 2 years ago
leloykun / steepest-descent-lean
View on GitHub
Deriving steepest descent convergence bounds and hyperparameter scaling laws in machine learning optimization from first principles, form…
☆16Apr 11, 2026Updated 3 months ago