BlackHC / neural_net_checklistLinks
☆150Updated last year
Alternatives and similar repositories for neural_net_checklist
Users that are interested in neural_net_checklist are comparing it to the libraries listed below
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated last month
- Diffusion models in PyTorch☆107Updated 2 months ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆119Updated 6 months ago
- Getting crystal-like representations with harmonic loss☆194Updated 4 months ago
- ☆275Updated last year
- 🧱 Modula software package☆225Updated last week
- The boundary of neural network trainability is fractal☆215Updated last year
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆399Updated 4 months ago
- ☆207Updated 8 months ago
- Exca - Execution and caching tool for python☆99Updated last week
- Efficient optimizers☆254Updated 3 weeks ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆124Updated last year
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 8 months ago
- Run PyTorch in JAX. 🤝☆277Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆143Updated 3 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆290Updated 11 months ago
- ☆307Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated last week
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆131Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆291Updated last year
- Scalable and Performant Data Loading☆291Updated this week
- ☆65Updated 9 months ago
- For optimization algorithm research and development.☆530Updated last week
- ☆21Updated last year
- supporting pytorch FSDP for optimizers☆84Updated 8 months ago
- Uncertainty quantification with PyTorch☆369Updated 4 months ago
- Interactive textbook on state-space models☆197Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 8 months ago
- ☆115Updated 2 months ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 2 months ago