cloneofsimo / insightful-nn-papers
These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning
☆48Updated last year
Alternatives and similar repositories for insightful-nn-papers:
Users that are interested in insightful-nn-papers are comparing it to the libraries listed below
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆75Updated 8 months ago
- ☆51Updated last year
- WIP☆93Updated 8 months ago
- ☆33Updated 7 months ago
- ☆78Updated 9 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆127Updated 2 months ago
- ☆22Updated 10 months ago
- Sparse Autoencoders for Stable Diffusion XL models.☆54Updated 2 weeks ago
- ☆27Updated 11 months ago
- JAX implementation ViT-VQGAN☆82Updated 2 years ago
- ☆51Updated 10 months ago
- ☆13Updated 10 months ago
- [ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images☆65Updated 2 years ago
- Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback☆27Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆123Updated last year
- ☆37Updated 7 months ago
- ☆27Updated 4 months ago
- A demo for the Direct Ascent Synthesis: Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)☆38Updated last month
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆48Updated last month
- supporting pytorch FSDP for optimizers☆80Updated 4 months ago
- ☆72Updated 2 years ago
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated 2 years ago
- ☆33Updated 5 months ago
- ☆20Updated 6 months ago
- Focused on fast experimentation and simplicity☆71Updated 4 months ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆71Updated 2 years ago
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Updated last year
- Model Stock: All we need is just a few fine-tuned models☆113Updated 7 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆73Updated 10 months ago
- Official Code Repository for the paper "Continuous Diffusion Model for Language Modeling".☆25Updated last month