HomebrewML/TrueGrad

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HomebrewML/TrueGrad)

HomebrewML / TrueGrad

PyTorch interface for TrueGrad Optimizers

☆43

Alternatives and similar repositories for TrueGrad

Users that are interested in TrueGrad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ClashLuke / tpucare
View on GitHub
Automatically take good care of your preemptible TPUs
☆37May 15, 2023Updated 3 years ago
GallagherCommaJack / modulax
View on GitHub
☆18Aug 24, 2024Updated last year
nathanbreitsch / torchmocks
View on GitHub
Test pytorch code with minimal computational overhead
☆26Jun 8, 2023Updated 3 years ago
cloneofsimo / repa-rf
View on GitHub
☆32Nov 4, 2024Updated last year
kvfrans / matrix-whitening
View on GitHub
Code for "What really matters in matrix-whitening optimizers?"
☆25Oct 31, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
zqOuO / GWT
View on GitHub
☆13May 4, 2026Updated 2 months ago
ClashLuke / PerfTorch
View on GitHub
High performance pytorch modules
☆18Jan 14, 2023Updated 3 years ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
subho406 / agalite
View on GitHub
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)
☆24Oct 15, 2024Updated last year
fal-ai-community / NativeSparseAttention
View on GitHub
research impl of Native Sparse Attention (2502.11089)
☆62Feb 19, 2025Updated last year
yoyolicoris / philtorch
View on GitHub
Fast linear discrete time filtering in PyTorch.
☆31Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lucidrains / nim-tokenizer
View on GitHub
Implementation of a simple BPE tokenizer, but in Nim
☆22Jul 2, 2023Updated 3 years ago
zaidbhat1234 / StyleGAN2-ADA
View on GitHub
This is an implementation of Image2StyleGAN embedding algorithm and various experiments using StyleGAN2-ADA as backbone.
☆17Sep 2, 2021Updated 4 years ago
esraaelelimy / rtus
View on GitHub
Real-Time RTUs
☆12Mar 20, 2026Updated 4 months ago
cognizant-ai-labs / autoinit
View on GitHub
AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks
☆30Oct 26, 2022Updated 3 years ago
jquesnelle / ctranslate2-rs
View on GitHub
Rust bindings for CTranslate2
☆14Jun 21, 2023Updated 3 years ago
fal-ai-community / llmdifftracker
View on GitHub
Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
☆32Feb 27, 2025Updated last year
teddykoker / learning-to-learn-jax
View on GitHub
JAX implementation of Learning to learn by gradient descent by gradient descent
☆29Aug 5, 2025Updated 11 months ago
kaloureyes3 / v4-clients
View on GitHub
☆10Apr 5, 2024Updated 2 years ago
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated 11 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
davisyoshida / lorax
View on GitHub
LoRA for arbitrary JAX models and functions
☆143Feb 26, 2024Updated 2 years ago
JesseFarebro / flax-mup
View on GitHub
Maximal Update Parametrization (μP) with Flax & Optax.
☆16Dec 27, 2023Updated 2 years ago
teddykoker / e3nn.c
View on GitHub
Pure C implementation of e3nn
☆25Mar 17, 2025Updated last year
HomebrewML / HomebrewNLP-torch
View on GitHub
A case study of efficient training of large language models using commodity hardware.
☆67Aug 4, 2022Updated 3 years ago
NMS05 / DinoV2-BERT-CLIP
View on GitHub
A simple PyTorch implementation of CLIP model using DinoV2 and BERT
☆16Sep 26, 2023Updated 2 years ago
gpuweb / tree-sitter-wgsl
View on GitHub
☆13Nov 27, 2025Updated 7 months ago
mgrankin / minGPT
View on GitHub
minGPT in JAX
☆49Jan 10, 2022Updated 4 years ago
nestordemeure / flaxOptimizers
View on GitHub
A collection of optimizers, some arcane others well known, for Flax.
☆29Aug 6, 2021Updated 4 years ago
sanderland / script_tok
View on GitHub
Code for the paper "BPE stays on SCRIPT", "Which Pieces Does Unigram Tokenization Really Need?" and MinGram
☆18Jun 26, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
teddykoker / tinyloader
View on GitHub
☆71Mar 24, 2025Updated last year
brianfitzgerald / jax-mmdit
View on GitHub
Implementation of Diffusion Transformers and Rectified Flow in Jax
☆27Jul 9, 2024Updated 2 years ago
arthurliu1998 / GAN-inversion
View on GitHub
Image2StyleGAN and Image2StyleGAN++ implementation
☆28Jul 15, 2021Updated 5 years ago
cloneofsimo / scaling-guide
View on GitHub
WIP
☆96Aug 13, 2024Updated last year
lucidrains / panoptic-transformer
View on GitHub
Another attempt at a long-context / efficient transformer by me
☆38Apr 11, 2022Updated 4 years ago
vaskonov / burvec
View on GitHub
Word Embeddings for Low Resource Languages: The Case of Buryat
☆10Mar 12, 2025Updated last year
kyo-takano / chinchilla
View on GitHub
A toolkit for scaling law research ⚖
☆68Jan 27, 2025Updated last year