BlackHC / neural_net_checklistLinks
☆210Updated last year
Alternatives and similar repositories for neural_net_checklist
Users that are interested in neural_net_checklist are comparing it to the libraries listed below
Sorting:
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 4 months ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆122Updated 9 months ago
- The boundary of neural network trainability is fractal☆221Updated last year
- ☆152Updated last month
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- Getting crystal-like representations with harmonic loss☆192Updated 8 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated last month
- Diffusion models in PyTorch☆116Updated this week
- 🧱 Modula software package☆307Updated 3 months ago
- Reliable, minimal and scalable library for pretraining foundation and world models☆98Updated 2 weeks ago
- Efficient optimizers☆276Updated 3 weeks ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆125Updated last year
- Universal Notation for Tensor Operations in Python.☆450Updated 7 months ago
- For optimization algorithm research and development.☆547Updated 2 weeks ago
- Exca - Execution and caching tool for python☆109Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆148Updated 2 months ago
- ☆285Updated last year
- ☆314Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆297Updated last year
- Named tensors with first-class dimensions for PyTorch☆332Updated 2 years ago
- Official JAX implementation of xLSTM including fast and efficient training and inference code. 7B model available at https://huggingface.…☆104Updated 10 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆297Updated last year
- Running Jax in PyTorch Lightning☆114Updated 11 months ago
- Highly commented implementations of Transformers in PyTorch☆139Updated 2 years ago
- supporting pytorch FSDP for optimizers☆84Updated 11 months ago
- ☆68Updated last year
- ☆61Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated 11 months ago
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆401Updated this week
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year