BlackHC / neural_net_checklistLinks
☆150Updated 11 months ago
Alternatives and similar repositories for neural_net_checklist
Users that are interested in neural_net_checklist are comparing it to the libraries listed below
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorch☆94Updated last week
- Getting crystal-like representations with harmonic loss☆192Updated 4 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆180Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆143Updated 2 months ago
- ☆206Updated 8 months ago
- The boundary of neural network trainability is fractal☆213Updated last year
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆119Updated 5 months ago
- Efficient optimizers☆252Updated last week
- 🧱 Modula software package☆210Updated last week
- Diffusion models in PyTorch☆107Updated last month
- Scalable and Performant Data Loading☆290Updated last week
- ☆304Updated last year
- ☆43Updated 2 months ago
- ☆275Updated last year
- Exca - Execution and caching tool for python☆89Updated this week
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆124Updated last year
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆392Updated 3 months ago
- For optimization algorithm research and development.☆525Updated this week
- ☆65Updated 8 months ago
- ☆115Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆101Updated 7 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆184Updated 10 months ago
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆291Updated 11 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆274Updated 2 weeks ago
- A State-Space Model with Rational Transfer Function Representation.☆79Updated last year
- WIP☆93Updated 11 months ago
- supporting pytorch FSDP for optimizers☆84Updated 7 months ago
- σ-GPT: A New Approach to Autoregressive Models☆67Updated 11 months ago
- Dion optimizer algorithm☆193Updated this week
- ☆21Updated last year