evanatyourservice/kron_torch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/evanatyourservice/kron_torch)

evanatyourservice / kron_torch

An implementation of PSGD Kron second-order optimizer for PyTorch

☆102

Alternatives and similar repositories for kron_torch

Users that are interested in kron_torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lixilinx / psgd_torch
View on GitHub
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆199May 30, 2026Updated last month
riverstone496 / awesome-second-order-optimization
View on GitHub
☆32May 17, 2026Updated 2 months ago
evanatyourservice / psgd_jax
View on GitHub
Implementation of PSGD optimizer in JAX
☆36Dec 31, 2024Updated last year
HomebrewML / HeavyBall
View on GitHub
Efficient optimizers
☆336Updated this week
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated last year
cloneofsimo / min-max-in-dit
View on GitHub
☆27May 3, 2024Updated 2 years ago
modula-systems / modula
View on GitHub
🧱 Modula software package
☆337Aug 18, 2025Updated 11 months ago
mwatkins1970 / SAE_Feature_Interpretability_Tool
View on GitHub
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…
☆19Oct 4, 2024Updated last year
nikhilvyas / SOAP
View on GitHub
☆275Dec 2, 2024Updated last year
Chillee / lit-llama
View on GitHub
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆10Aug 29, 2023Updated 2 years ago
kkyuhun94 / dalda
View on GitHub
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
☆33Feb 6, 2026Updated 5 months ago
nreimers / se-pytorch-xla
View on GitHub
☆21Sep 6, 2021Updated 4 years ago
moucheng2017 / SOP-LVM-ICL-Ensemble
View on GitHub
[NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…
☆23Mar 16, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
crypdick / timm-lr-scheduler-explorer
View on GitHub
A dashboard for exploring timm learning rate schedulers
☆20Nov 22, 2024Updated last year
fal-ai-community / llmdifftracker
View on GitHub
Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
☆32Feb 27, 2025Updated last year
AhmedZgaren / Save
View on GitHub
☆33Oct 2, 2025Updated 9 months ago
Gengzigang / TokenSet
View on GitHub
Official PyTorch implementation of TokenSet.
☆129Mar 21, 2025Updated last year
xdit-project / mochi-xdit
View on GitHub
faster parallel inference of mochi-1 video generation model
☆123Feb 25, 2025Updated last year
theAdamColton / ijepa-enhanced
View on GitHub
recipe for training fully-featured self supervised image jepa models
☆14Jun 4, 2025Updated last year
andravin / spio
View on GitHub
Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.
☆47Feb 9, 2026Updated 5 months ago
maxjeblick / llm-docstring-generator
View on GitHub
☆21Apr 13, 2024Updated 2 years ago
erfanzar / Spectrax
View on GitHub
SpecTrax is a JAX-native library for neural networks and graph learning, built for performance, composability and modularity.
☆42Jul 13, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NVlabs / STL
View on GitHub
Official Pytorch Implementation of Self-emerging Token Labeling
☆35Mar 27, 2024Updated 2 years ago
apple / ml-dataset-decomposition
View on GitHub
Official repo of dataset-decomposition paper [NeurIPS 2024]
☆21Jan 8, 2025Updated last year
erfanzar / eformer
View on GitHub
(EasyDel Former) is a utility library designed to simplify and enhance the development in JAX
☆33Updated this week
tianyi-lab / C3PO
View on GitHub
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆21Apr 9, 2025Updated last year
alenic / timm-models-explorer
View on GitHub
Timm model explorer
☆42Apr 12, 2024Updated 2 years ago
marin-community / haliax
View on GitHub
Named Tensors for Legible Deep Learning in JAX
☆227Nov 8, 2025Updated 8 months ago
nabla-ml / nabla
View on GitHub
Nabla: High-Performance Scientific Computing
☆345Mar 6, 2026Updated 4 months ago
dvruette / gidd-easydel
View on GitHub
☆25Dec 16, 2025Updated 7 months ago
young-geng / mintext
View on GitHub
Minimal but scalable implementation of large language models in JAX
☆34Nov 28, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KellerJordan / cifar10-airbench
View on GitHub
CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds
☆386Nov 15, 2025Updated 8 months ago
cloneofsimo / zeroshampoo
View on GitHub
☆33Sep 10, 2024Updated last year
HomebrewML / Olmax
View on GitHub
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Jan 20, 2024Updated 2 years ago
VatsaDev / NanoPoor
View on GitHub
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Apr 22, 2025Updated last year
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago