warner-benjamin / commented-transformers
Highly commented implementations of Transformers in PyTorch
☆132Updated last year
Alternatives and similar repositories for commented-transformers:
Users that are interested in commented-transformers are comparing it to the libraries listed below
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated 8 months ago
- A miniture AI training framework for PyTorch☆39Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Fast bare-bones BPE for modern tokenizer training☆142Updated 3 months ago
- An introduction to LLM Sampling☆75Updated last month
- Source notebook code for the course, stripped of all information. Please consider puchasing the course at https://store.walkwithfastai.co…☆36Updated 11 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆101Updated 4 months ago
- Train fastai models faster (and other useful tools)☆64Updated 7 months ago
- Functional local implementations of main model parallelism approaches☆95Updated last year
- A comprehensive deep dive into the world of tokens☆215Updated 7 months ago
- ☆77Updated 8 months ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- ☆149Updated 5 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆119Updated 6 months ago
- ☆163Updated 7 months ago
- Puzzles for exploring transformers☆331Updated last year
- A puzzle to learn about prompting☆123Updated last year
- ☆92Updated last year
- ☆147Updated last month
- An interactive exploration of Transformer programming.☆256Updated last year
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆78Updated 6 months ago
- I learn about and explain quantization☆26Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- ☆49Updated 8 months ago
- Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracy☆126Updated last year
- Gzip and nearest neighbors for text classification☆56Updated last year
- ☆198Updated 11 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆121Updated 9 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆225Updated 2 months ago
- ☆83Updated last year