BlackHC / neural_net_checklistLinks
☆150Updated last year
Alternatives and similar repositories for neural_net_checklist
Users that are interested in neural_net_checklist are comparing it to the libraries listed below
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated 3 months ago
- Diffusion models in PyTorch☆112Updated last week
- The boundary of neural network trainability is fractal☆217Updated last year
- Getting crystal-like representations with harmonic loss☆192Updated 7 months ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆121Updated 8 months ago
- For optimization algorithm research and development.☆543Updated last week
- ☆117Updated 2 weeks ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆187Updated 3 weeks ago
- Efficient optimizers☆276Updated 3 weeks ago
- ☆222Updated 11 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆123Updated last year
- 🧱 Modula software package☆303Updated 2 months ago
- Exca - Execution and caching tool for python☆108Updated last week
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- ☆21Updated last year
- ☆285Updated last year
- ☆68Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆297Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆148Updated last month
- Uncertainty quantification with PyTorch☆375Updated last month
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆35Updated 3 years ago
- Reliable, minimal and scalable library for pretraining foundation and world models☆82Updated this week
- A State-Space Model with Rational Transfer Function Representation.☆82Updated last year
- Minimal GPT (~350 lines with a simple task to test it)☆63Updated 11 months ago
- Universal Notation for Tensor Operations in Python.☆447Updated 7 months ago
- Deep Learning, an Energy Approach☆219Updated 5 months ago
- ☆91Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆322Updated 3 months ago
- Parametric differentiable curves with PyTorch for KANs, continuous embeddings, or shape-restricted models☆38Updated this week