Highly commented implementations of Transformers in PyTorch
☆138Aug 2, 2023Updated 2 years ago
Alternatives and similar repositories for commented-transformers
Users that are interested in commented-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NLP with Transformers Study Group Materials & Resources☆11Jun 26, 2023Updated 2 years ago
- ☆24May 19, 2024Updated last year
- ☆30Mar 10, 2024Updated 2 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆76Mar 27, 2026Updated 2 weeks ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197May 6, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source notebook code for the course, stripped of all information. Please consider puchasing the course at https://store.walkwithfastai.co…☆36Feb 14, 2024Updated 2 years ago
- An AI/ML solution that provides a probability that a hard drive will fail within some pre-defined time period.☆13Dec 9, 2022Updated 3 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Experiments with self-supervised learning☆11Mar 9, 2020Updated 6 years ago
- In-place debugger for the fastai library and pytorch☆28Aug 25, 2021Updated 4 years ago
- Machine Learning Ops Project☆30Mar 19, 2024Updated 2 years ago
- Analysis code for knowledge discovery project☆12Sep 25, 2018Updated 7 years ago
- ☆83Apr 16, 2024Updated last year
- Transformers training in a supercomputer with the 🤗 Stack and Slurm☆15May 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Research repo for code that may or may not end up in fastai3☆50Jun 15, 2021Updated 4 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- Map (deep learning) model weights between different model implementations.☆19Mar 9, 2026Updated last month
- ☆162Dec 2, 2024Updated last year
- Code for training a Resnet model for the Human Protein Atlas Image Classification competition☆50Nov 20, 2018Updated 7 years ago
- ☆170Jun 3, 2024Updated last year
- ☆17Jul 28, 2023Updated 2 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 6 months ago
- ☆94Oct 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Fast AI Practical Deep Learning for Coders experiments in Stable Diffusion☆24Nov 10, 2022Updated 3 years ago
- ☆15Feb 28, 2022Updated 4 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- ☆176Feb 3, 2024Updated 2 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆895Apr 17, 2024Updated last year
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆154Apr 14, 2024Updated last year
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago
- Materials for "Transformers from the Ground Up" at PyData Jeddah on August 5, 2021☆20Aug 5, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Getting started with diffusion☆685Mar 27, 2024Updated 2 years ago
- Handling long-running processes (like ML model predictions) inside a Flask app using Celery.☆12Jan 13, 2021Updated 5 years ago
- ☆63Sep 23, 2024Updated last year
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆34Mar 5, 2024Updated 2 years ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆120Feb 12, 2024Updated 2 years ago
- 🎬 analyzing three decades of movie data☆15Dec 9, 2022Updated 3 years ago
- ☆26Jan 9, 2026Updated 3 months ago