cloneofsimo / insightful-nn-papersLinks

These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning

☆48

Alternatives and similar repositories for insightful-nn-papers

Users that are interested in insightful-nn-papers are comparing it to the libraries listed below

Sorting:

cloneofsimo / ezmup
Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam
☆85Updated last year
cloneofsimo / karras-power-ema-tutorial
☆53Updated last year
cloneofsimo / scaling-guide
WIP
☆93Updated last year
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
cloneofsimo / min-fsdp
☆91Updated last year
graphcore-research / pytorch-tensor-tracker
Flexibly track outputs and grad-outputs of torch.nn.Module.
☆13Updated 2 years ago
surkovv / sdxl-unbox
Sparse Autoencoders for Stable Diffusion XL models.
☆76Updated 3 weeks ago
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆83Updated 11 months ago
cloneofsimo / zeroshampoo
☆34Updated last year
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆35Updated 3 years ago
cloneofsimo / efae
☆23Updated last year
gregorbachmann / scaling_mlps
☆52Updated last year
cloneofsimo / minSAE
☆30Updated 11 months ago
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆37Updated 2 years ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆100Updated last year
fal-ai-community / nano-mdm
Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun
☆57Updated 8 months ago
louaaron / Reflected-Diffusion
[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)
☆158Updated 2 years ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆172Updated 4 months ago
patil-suraj / stable-diffusion-jax
☆90Updated 3 years ago
themrzmaster / git-re-basin-pytorch
Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch
☆78Updated 2 years ago
igul222 / plaid
☆108Updated 2 years ago
patil-suraj / vit-vqgan
JAX implementation ViT-VQGAN
☆82Updated 3 years ago
borisdayma / clip-jax
Train vision models using JAX and 🤗 transformers
☆99Updated 2 weeks ago
fal-ai / diffusion-speedrun
Focused on fast experimentation and simplicity
☆75Updated 10 months ago
facebookresearch / EvalGIM
🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…
☆88Updated 11 months ago
crowsonkb / consistency-models
A JAX implementation of the continuous time formulation of Consistency Models
☆84Updated 2 years ago
lucidrains / discrete-key-value-bottleneck-pytorch
Implementation of Discrete Key / Value Bottleneck, in Pytorch
☆88Updated 2 years ago
tmabraham / ddpo-pytorch
Reproduction of DDPO paper (RLHF for diffusion)
☆93Updated 2 years ago
crowsonkb / cloob-training
CLOOB training (JAX) and inference (JAX and PyTorch)
☆74Updated 3 years ago
dvruette / barrel-rec-pytorch
☆53Updated last year