Highly commented implementations of Transformers in PyTorch
☆138Aug 2, 2023Updated 2 years ago
Alternatives and similar repositories for commented-transformers
Users that are interested in commented-transformers are comparing it to the libraries listed below
Sorting:
- ☆30Mar 10, 2024Updated last year
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆39Nov 21, 2025Updated 3 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196May 6, 2024Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆73Updated this week
- ☆24May 19, 2024Updated last year
- Train fastai models faster (and other useful tools)☆73Jun 4, 2025Updated 8 months ago
- ☆10Feb 12, 2024Updated 2 years ago
- Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.☆327Apr 12, 2023Updated 2 years ago
- Experiments with self-supervised learning☆11Mar 9, 2020Updated 5 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- ☆12Apr 22, 2024Updated last year
- Source notebook code for the course, stripped of all information. Please consider puchasing the course at https://store.walkwithfastai.co…☆36Feb 14, 2024Updated 2 years ago
- Machine Learning Ops Project☆30Mar 19, 2024Updated last year
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Transformers training in a supercomputer with the 🤗 Stack and Slurm☆15May 9, 2024Updated last year
- An AI/ML solution that provides a probability that a hard drive will fail within some pre-defined time period.☆14Dec 9, 2022Updated 3 years ago
- ☆177Feb 3, 2024Updated 2 years ago
- ☆94Oct 5, 2023Updated 2 years ago
- ☆15Feb 28, 2022Updated 4 years ago
- Documentation Sprint for the fastai deep learning library☆15May 11, 2022Updated 3 years ago
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆154Apr 14, 2024Updated last year
- ☆162Dec 2, 2024Updated last year
- Ipython notebook copy of Andrej Karpathy's llama2.c☆23Sep 5, 2023Updated 2 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- [Early Release] Quarto Extension for Automatic Language Tabs☆24Nov 26, 2024Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Apr 17, 2024Updated last year
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆117Feb 12, 2024Updated 2 years ago
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆73May 26, 2024Updated last year
- This contains notebooks and scripts used to support my writing in WILMOTT Magazine.☆17May 9, 2024Updated last year
- Exploring Applications of GRPO☆251Aug 25, 2025Updated 6 months ago
- LLM plugin providing access to Mistral models using the Mistral API☆208Jul 22, 2025Updated 7 months ago
- ☆82Apr 16, 2024Updated last year
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Jun 13, 2023Updated 2 years ago
- ☆27Dec 17, 2024Updated last year
- ☆19Feb 20, 2023Updated 3 years ago
- When Reasoning Meets Its Laws☆35Jan 2, 2026Updated 2 months ago
- Useful LLM contexts ready to be used in AIMagic☆32Jan 29, 2026Updated last month
- ☆31Jan 18, 2025Updated last year