warner-benjamin / commented-transformers
Highly commented implementations of Transformers in PyTorch
☆136Updated last year
Alternatives and similar repositories for commented-transformers
Users that are interested in commented-transformers are comparing it to the libraries listed below
Sorting:
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated last year
- A miniture AI training framework for PyTorch☆42Updated 3 months ago
- deep learning with pytorch lightning☆1Updated 6 months ago
- Source notebook code for the course, stripped of all information. Please consider puchasing the course at https://store.walkwithfastai.co…☆37Updated last year
- ☆150Updated 9 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 9 months ago
- ☆92Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆109Updated 7 months ago
- ☆78Updated 11 months ago
- I learn about and explain quantization☆26Updated last year
- ☆91Updated last month
- A comprehensive deep dive into the world of tokens☆223Updated 10 months ago
- Train fastai models faster (and other useful tools)☆68Updated 11 months ago
- An interactive exploration of Transformer programming.☆263Updated last year
- Fast bare-bones BPE for modern tokenizer training☆154Updated last month
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- ML/DL Math and Method notes☆60Updated last year
- Simple Transformer in Jax☆136Updated 10 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆132Updated 4 months ago
- Solve puzzles. Learn CUDA.☆64Updated last year
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracy☆129Updated 2 years ago
- An introduction to LLM Sampling☆78Updated 5 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated 11 months ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆92Updated 10 months ago
- ☆123Updated 6 months ago
- git extension for {collaborative, communal, continual} model development☆213Updated 6 months ago
- ☆302Updated 10 months ago