cloneofsimo / insightful-nn-papers
These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning
☆47Updated last year
Alternatives and similar repositories for insightful-nn-papers:
Users that are interested in insightful-nn-papers are comparing it to the libraries listed below
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 6 months ago
- ☆51Updated last year
- WIP☆93Updated 6 months ago
- ☆46Updated 3 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆84Updated last month
- ☆75Updated 7 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆121Updated 9 months ago
- supporting pytorch FSDP for optimizers☆76Updated 2 months ago
- ☆33Updated 5 months ago
- ☆21Updated 7 months ago
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Updated last year
- ☆26Updated 9 months ago
- Model Stock: All we need is just a few fine-tuned models☆102Updated 4 months ago
- ☆26Updated 3 weeks ago
- My take on Flow Matching☆36Updated last month
- Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback☆27Updated last year
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Focused on fast experimentation and simplicity☆65Updated last month
- ☆51Updated 8 months ago
- ☆33Updated 5 months ago
- Minimal Implementation of a D3PM in pytorch☆195Updated 9 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆65Updated 10 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated last year
- Language models scale reliably with over-training and on downstream tasks☆96Updated 10 months ago
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆157Updated last year
- ☆88Updated 8 months ago
- ☆24Updated 2 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆70Updated last year
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆55Updated 8 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆67Updated 8 months ago