Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆294Mar 27, 2026Updated 3 months ago
Alternatives and similar repositories for transformer-from-scratch
Users that are interested in transformer-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,098Mar 20, 2025Updated last year
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated 4 months ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2…☆21Jul 19, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 3 years ago
- How certain is your transformer?☆25Apr 25, 2021Updated 5 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 4 years ago
- Pacmed Labs experiments on uncertainty estimation, focusing on unbalanced tabular data and classification tasks.☆21May 26, 2021Updated 5 years ago
- OpenCopilot flows editor☆12Oct 31, 2023Updated 2 years ago
- Light C++11 graph library☆13Sep 16, 2021Updated 4 years ago
- Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.☆23Jul 30, 2024Updated last year
- ☆13Jul 26, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆50Dec 6, 2024Updated last year
- ☆23May 6, 2020Updated 6 years ago
- A community repository for benchmarking Bayesian methods☆11May 25, 2023Updated 3 years ago
- Laplace Redux -- Effortless Bayesian Deep Learning☆45Jun 6, 2025Updated last year
- A dataset of alignment research and code to reproduce it☆80Jun 22, 2023Updated 3 years ago
- Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…☆14Aug 15, 2021Updated 4 years ago
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆22Jun 3, 2022Updated 4 years ago
- A very minimal ml project template that uses HF transformers and wandb to train a simple NN and evaluate it, in a stateless manner compat…☆45Apr 8, 2023Updated 3 years ago
- Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆28Jun 18, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Mutual information estimators and benchmark☆62Sep 16, 2025Updated 9 months ago
- Introduction to Generative Adversarial Networks☆22Oct 22, 2020Updated 5 years ago
- ☆10Oct 27, 2020Updated 5 years ago
- Presentations from meetups and conferences☆18Sep 4, 2020Updated 5 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- Riemannian metrics to measure distances in latent space of VAEs☆14Jan 7, 2019Updated 7 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated 2 years ago
- The code for Meta Learning for SGMCMC☆25Feb 21, 2019Updated 7 years ago
- This repository contains examples of using PaliGemma for tasks such as object detection, segmentation, image captioning, etc.☆22Feb 17, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Mar 22, 2023Updated 3 years ago
- ☆31Sep 7, 2023Updated 2 years ago
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆50Aug 20, 2024Updated last year
- Imshow - Flexible and Customizable Image Display with Python☆14Dec 27, 2025Updated 6 months ago
- ☆10Oct 4, 2022Updated 3 years ago
- FeelingBlue: A Corpus for Understanding the Emotional Connotation of Color in Context, accepted at TACL 2022, presented at ACL 2023☆13Dec 28, 2023Updated 2 years ago
- A graphical editor for directed graphs used for Abstract Meaning Representation (AMR)☆13May 9, 2026Updated last month