cloneofsimo/min-max-in-dit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cloneofsimo/min-max-in-dit)

cloneofsimo / min-max-in-dit

☆27

Alternatives and similar repositories for min-max-in-dit

Users that are interested in min-max-in-dit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cloneofsimo / imagenet.int8
View on GitHub
☆40Apr 27, 2024Updated 2 years ago
cloneofsimo / project_RF
View on GitHub
☆24Jun 4, 2024Updated 2 years ago
cloneofsimo / efae
View on GitHub
☆24Jun 18, 2024Updated 2 years ago
Laz4rz / mup
View on GitHub
Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation
☆14Jan 2, 2026Updated 6 months ago
cloneofsimo / min-max-gpt
View on GitHub
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Apr 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SwayStar123 / reimei
View on GitHub
☆28Oct 7, 2025Updated 9 months ago
cloneofsimo / fim-llama-deepspeed
View on GitHub
☆33Jan 1, 2024Updated 2 years ago
cloneofsimo / ptar
View on GitHub
☆13Jun 3, 2024Updated 2 years ago
Chillee / lit-llama
View on GitHub
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆10Aug 29, 2023Updated 2 years ago
fal-ai-community / nano-mdm
View on GitHub
Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun
☆57Mar 10, 2025Updated last year
cloneofsimo / minDinoV2
View on GitHub
☆24Oct 15, 2024Updated last year
mingukkang / FlashDecoder
View on GitHub
Official FlashDecoder Github
☆17Apr 4, 2026Updated 3 months ago
cloneofsimo / minRF
View on GitHub
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
☆641Jul 1, 2024Updated 2 years ago
robincourant / blunf
View on GitHub
☆11Sep 13, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ethansmith2000 / clip-text-directions
View on GitHub
☆20May 29, 2026Updated last month
Eliyas0007 / Pytorch-Intention
View on GitHub
Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention
☆12May 24, 2023Updated 3 years ago
Helw150 / levanter
View on GitHub
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆16Jun 16, 2024Updated 2 years ago
lucidrains / holodeck-pytorch
View on GitHub
Implementation of a holodeck, written in Pytorch
☆19Nov 1, 2023Updated 2 years ago
romainloiseau / BoiteAOutilsLegistique
View on GitHub
📖 Application développée pour simplifier l'analyse et la gestion des textes juridiques français et européens en utilisant des modèles d'…
☆15Aug 20, 2025Updated 11 months ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
evanatyourservice / kron_torch
View on GitHub
An implementation of PSGD Kron second-order optimizer for PyTorch
☆102Jul 24, 2025Updated last year
Infatoshi / driftin
View on GitHub
Single-step image generation at 306 FPS. Drifting vs Diffusion head-to-head on CIFAR-10.
☆45Feb 13, 2026Updated 5 months ago
cloneofsimo / karras-power-ema-tutorial
View on GitHub
☆53Jan 6, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fal-ai / diffusion-speedrun
View on GitHub
Focused on fast experimentation and simplicity
☆77Dec 24, 2024Updated last year
robincourant / jaws
View on GitHub
☆15Oct 10, 2023Updated 2 years ago
cloneofsimo / scaling-guide
View on GitHub
WIP
☆96Aug 13, 2024Updated last year
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
graphcore-research / unit-scaling
View on GitHub
A library for unit scaling in PyTorch
☆135Jul 11, 2025Updated last year
SwayStar123 / SpeedrunDiT
View on GitHub
SR-DiT Speedrunning ImageNet Diffusion
☆139Apr 6, 2026Updated 3 months ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
Dogacel / Attention-Drift
View on GitHub
Code for the paper *Attention Drift: What Speculative Decoding Models Learn*.
☆27May 12, 2026Updated 2 months ago
cloneofsimo / vqgan-training
View on GitHub
Train VAE like a boss
☆313Oct 21, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jbaron34 / torchwindow
View on GitHub
Display tensors directly from GPU
☆12Oct 12, 2025Updated 9 months ago
flukeskywalker / nanoDD
View on GitHub
Simple Scalable Discrete Diffusion for text in PyTorch
☆37Sep 27, 2024Updated last year
SwayStar123 / microdiffusion
View on GitHub
☆49Feb 23, 2025Updated last year
pietro-sillano / SindyPendulum
View on GitHub
☆13Jun 16, 2026Updated last month
catlab-team / fantasticstyles
View on GitHub
Repository for Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs
☆28Mar 17, 2022Updated 4 years ago
john-rocky / CoreML-StyleGAN
View on GitHub
The sample project how to use MobileStyleGAN in iOS.
☆19Dec 26, 2021Updated 4 years ago
tae898 / vae-diffusion
View on GitHub
☆34Jul 8, 2025Updated last year