cloneofsimo / insightful-nn-papers
These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning
☆47Updated last year
Alternatives and similar repositories for insightful-nn-papers:
Users that are interested in insightful-nn-papers are comparing it to the libraries listed below
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 7 months ago
- ☆51Updated last year
- WIP☆93Updated 7 months ago
- ☆33Updated 6 months ago
- ☆49Updated 4 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆103Updated 3 weeks ago
- ☆34Updated 6 months ago
- ☆75Updated 8 months ago
- ☆21Updated 8 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆122Updated 10 months ago
- ☆26Updated 10 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆68Updated 11 months ago
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆71Updated 9 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated 11 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆24Updated 3 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated last year
- supporting pytorch FSDP for optimizers☆79Updated 3 months ago
- ☆51Updated 9 months ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Model Stock: All we need is just a few fine-tuned models☆105Updated 5 months ago
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆158Updated last year
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆132Updated last year
- ☆13Updated 9 months ago
- ☆90Updated last year
- Train VAE like a boss☆270Updated 4 months ago
- Focused on fast experimentation and simplicity☆69Updated 2 months ago