BlackHC / neural_net_checklist
☆139Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for neural_net_checklist
- ☆128Updated this week
- Scalable neural net training via automatic normalization in the modular norm.☆121Updated 3 months ago
- Diffusion models in PyTorch☆87Updated last month
- The AdEMAMix Optimizer: Better, Faster, Older.☆172Updated 2 months ago
- ☆292Updated 4 months ago
- The boundary of neural network trainability is fractal☆161Updated 9 months ago
- For optimization algorithm research and development.☆449Updated this week
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆119Updated 3 months ago
- A State-Space Model with Rational Transfer Function Representation.☆70Updated 6 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆120Updated last year
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆328Updated last month
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆325Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆95Updated 2 weeks ago
- Scalable and Performant Data Loading☆66Updated this week
- Multidimensional indexing for tensors☆113Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆59Updated 3 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆252Updated 5 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆84Updated last week
- A simple implimentation of Bayesian Flow Networks (BFN)☆239Updated 10 months ago
- Library for Jacobian descent with PyTorch. It enables optimization of neural networks with multiple losses (e.g. multi-task learning).☆154Updated this week
- Efficient optimizers☆79Updated this week
- Uncertainty quantification with PyTorch☆328Updated 2 weeks ago
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆109Updated last week
- WIP☆89Updated 3 months ago
- Run PyTorch in JAX. 🤝☆200Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆85Updated 2 months ago
- ☆197Updated 4 months ago
- ☆53Updated 10 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆95Updated 6 months ago
- Accelerated First Order Parallel Associative Scan☆163Updated 3 months ago