BlackHC / neural_net_checklistLinks
β150Updated last year
Alternatives and similar repositories for neural_net_checklist
Users that are interested in neural_net_checklist are comparing it to the libraries listed below
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorchβ97Updated 2 months ago
- π§± Modula software packageβ277Updated last month
- β215Updated 10 months ago
- The boundary of neural network trainability is fractalβ218Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation preconditionβ¦β183Updated last week
- Exca - Execution and caching tool for pythonβ106Updated this week
- Efficient optimizersβ265Updated this week
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of newβ¦β125Updated last year
- Diffusion models in PyTorchβ111Updated 2 weeks ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conteβ¦β120Updated 7 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resourcesβ146Updated this week
- The AdEMAMix Optimizer: Better, Faster, Older.β186Updated last year
- Getting crystal-like representations with harmonic lossβ194Updated 6 months ago
- β58Updated last year
- supporting pytorch FSDP for optimizersβ84Updated 9 months ago
- β281Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingβ132Updated last year
- β120Updated 3 months ago
- β89Updated last year
- β21Updated last year
- Supporting code for the blog post on modular manifolds.β39Updated last week
- β309Updated last year
- For optimization algorithm research and development.β539Updated last week
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.β293Updated last year
- Universal Notation for Tensor Operations in Python.β434Updated 5 months ago
- Maximal Update Parametrization (ΞΌP) with Flax & Optax.β16Updated last year
- Because we don't have enough time to read everythingβ89Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.β164Updated 3 months ago
- Easily run PyTorch on multiple GPUs & machinesβ47Updated 3 months ago
- A State-Space Model with Rational Transfer Function Representation.β81Updated last year