Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆289Mar 27, 2026Updated last month
Alternatives and similar repositories for transformer-from-scratch
Users that are interested in transformer-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,097Mar 20, 2025Updated last year
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- A Simplified PyTorch Implementation of Vision Transformer (ViT)☆252Jun 10, 2024Updated last year
- ☆531Apr 29, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 11 months ago
- PyTorch training at CSCS☆22Jul 4, 2025Updated 10 months ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- Pacmed Labs experiments on uncertainty estimation, focusing on unbalanced tabular data and classification tasks.☆21May 26, 2021Updated 4 years ago
- ☆10Jun 14, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated last year
- Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.☆23Jul 30, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Sep 13, 2021Updated 4 years ago
- A community repository for benchmarking Bayesian methods☆11May 25, 2023Updated 2 years ago
- A dataset of alignment research and code to reproduce it☆78Jun 22, 2023Updated 2 years ago
- Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…☆14Aug 15, 2021Updated 4 years ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆157Feb 6, 2025Updated last year
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆55May 5, 2023Updated 3 years ago
- Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆28Jun 18, 2025Updated 11 months ago
- ☆16May 9, 2022Updated 4 years ago
- An example app that demos how to use TFLite to do automatic speech recognition on-device☆17Oct 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Kotlin LLM Prompts and User Interface☆36May 8, 2026Updated last week
- Introduction to Generative Adversarial Networks☆22Oct 22, 2020Updated 5 years ago
- Continuous Bag-of-Words (CBOW model implemented in pytorch☆14Feb 27, 2018Updated 8 years ago
- ☆10Oct 27, 2020Updated 5 years ago
- R package for calculation of 22 CAnonical Time-series CHaracteristics☆22Oct 3, 2024Updated last year
- Riemannian metrics to measure distances in latent space of VAEs☆14Jan 7, 2019Updated 7 years ago
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆17Jul 18, 2024Updated last year
- The code for Meta Learning for SGMCMC☆25Feb 21, 2019Updated 7 years ago
- This repository contains examples of using PaliGemma for tasks such as object detection, segmentation, image captioning, etc.☆22Feb 17, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Graphical tools for Bayesian inference and posterior predictive checks☆21Sep 21, 2021Updated 4 years ago
- ☆31Sep 7, 2023Updated 2 years ago
- Imshow - Flexible and Customizable Image Display with Python☆14Dec 27, 2025Updated 4 months ago
- ☆11Aug 3, 2021Updated 4 years ago
- ☆12Oct 17, 2022Updated 3 years ago
- Monitoring of AI Regulations☆19May 30, 2021Updated 4 years ago
- ☆10Oct 4, 2022Updated 3 years ago