cloneofsimo / insightful-nn-papers
These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning
☆47Updated last year
Alternatives and similar repositories for insightful-nn-papers:
Users that are interested in insightful-nn-papers are comparing it to the libraries listed below
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 5 months ago
- ☆51Updated last year
- WIP☆92Updated 5 months ago
- ☆75Updated 6 months ago
- ☆33Updated 4 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆121Updated 9 months ago
- supporting pytorch FSDP for optimizers☆75Updated last month
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Updated last year
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆20Updated 2 months ago
- ☆44Updated 2 months ago
- ☆21Updated 7 months ago
- ☆51Updated 7 months ago
- ☆13Updated 7 months ago
- Focused on fast experimentation and simplicity☆64Updated 3 weeks ago
- ☆24Updated last month
- ☆31Updated 4 months ago
- My take on Flow Matching☆30Updated last week
- Model Stock: All we need is just a few fine-tuned models☆99Updated 3 months ago
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆157Updated last year
- ☆37Updated 8 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆47Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆90Updated last month
- Minimal Implementation of a D3PM in pytorch☆192Updated 8 months ago
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆129Updated last year
- Train VAE like a boss☆252Updated 2 months ago
- Experiment of using Tangent to autodiff triton☆74Updated 11 months ago
- ☆26Updated 8 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆62Updated 9 months ago
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆70Updated last year