Highly commented implementations of Transformers in PyTorch
☆139Aug 2, 2023Updated 2 years ago
Alternatives and similar repositories for commented-transformers
Users that are interested in commented-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24May 19, 2024Updated 2 years ago
- ☆30Mar 10, 2024Updated 2 years ago
- Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.☆327Apr 12, 2023Updated 3 years ago
- Documentation Sprint for the fastai deep learning library☆15May 11, 2022Updated 4 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆78May 30, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197May 6, 2024Updated 2 years ago
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆39Nov 21, 2025Updated 6 months ago
- Source notebook code for the course, stripped of all information. Please consider puchasing the course at https://store.walkwithfastai.co…☆36Feb 14, 2024Updated 2 years ago
- An AI/ML solution that provides a probability that a hard drive will fail within some pre-defined time period.☆13Dec 9, 2022Updated 3 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Experiments with self-supervised learning☆11Mar 9, 2020Updated 6 years ago
- In-place debugger for the fastai library and pytorch☆28Aug 25, 2021Updated 4 years ago
- Machine Learning Ops Project☆30Mar 19, 2024Updated 2 years ago
- Analysis code for knowledge discovery project☆12Sep 25, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆83Apr 16, 2024Updated 2 years ago
- Transformers training in a supercomputer with the 🤗 Stack and Slurm☆15May 9, 2024Updated 2 years ago
- Research repo for code that may or may not end up in fastai3☆50Jun 15, 2021Updated 4 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- Map (deep learning) model weights between different model implementations.☆19Mar 9, 2026Updated 3 months ago
- ☆161Dec 2, 2024Updated last year
- [NAACL(2019)] Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models☆11Apr 27, 2022Updated 4 years ago
- Code for training a Resnet model for the Human Protein Atlas Image Classification competition☆50Nov 20, 2018Updated 7 years ago
- ☆11Oct 21, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆69May 20, 2026Updated 3 weeks ago
- ☆170Jun 3, 2024Updated 2 years ago
- ☆17Jul 28, 2023Updated 2 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 8 months ago
- ☆94Oct 5, 2023Updated 2 years ago
- ☆15Feb 28, 2022Updated 4 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- ☆178Feb 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Apr 17, 2024Updated 2 years ago
- Useful LLM contexts ready to be used in AIMagic☆32Apr 6, 2026Updated 2 months ago
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆154Apr 14, 2024Updated 2 years ago
- Exploring Applications of GRPO☆253Aug 25, 2025Updated 9 months ago
- Getting started with diffusion☆688Mar 27, 2024Updated 2 years ago
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆35Mar 5, 2024Updated 2 years ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆120Feb 12, 2024Updated 2 years ago