stas00 / ml-ways
ML/DL Math and Method notes
☆60Updated last year
Alternatives and similar repositories for ml-ways:
Users that are interested in ml-ways are comparing it to the libraries listed below
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po …☆91Updated last year
- Make triton easier☆47Updated 10 months ago
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year
- Experiment of using Tangent to autodiff triton☆78Updated last year
- Python tools☆12Updated last year
- Context Manager to profile the forward and backward times of PyTorch's nn.Module☆83Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆49Updated last week
- Custom kernels in Triton language for accelerating LLMs☆18Updated last year
- train with kittens!☆57Updated 6 months ago
- ☆43Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- ☆78Updated 9 months ago
- Train, tune, and infer Bamba model☆88Updated this week
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆82Updated last year
- PyTorch centric eager mode debugger☆47Updated 4 months ago
- Learn CUDA with PyTorch☆20Updated 2 months ago
- Cray-LM unified training and inference stack.☆22Updated 2 months ago
- Utilities for Training Very Large Models☆58Updated 7 months ago
- Slides and recordings of talks hosted by our community☆20Updated 10 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆45Updated 9 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆41Updated this week
- Highly commented implementations of Transformers in PyTorch☆136Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- ☆20Updated last year
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 10 months ago
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Updated last week
- Collection of autoregressive model implementation☆85Updated 2 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated last month
- Gzip and nearest neighbors for text classification☆56Updated last year
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆59Updated 3 months ago