Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆293Mar 27, 2026Updated 2 months ago
Alternatives and similar repositories for transformer-from-scratch
Users that are interested in transformer-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,098Mar 20, 2025Updated last year
- The code for the video tutorial series on building a Transformer from scratch: https://www.youtube.com/watch?v=XR4VDnJzB8o☆19Apr 15, 2023Updated 3 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- Distributed Communication-Optimal Shuffle and Transpose Algorithm☆14Apr 18, 2026Updated last month
- Transformer implementation from scratch (in PyTorch)☆19Jun 17, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2…☆21Jul 19, 2023Updated 2 years ago
- Communication Avoiding Numerical Dense Matrix Computations☆11Dec 20, 2020Updated 5 years ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- How certain is your transformer?☆25Apr 25, 2021Updated 5 years ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated last year
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Quantum Algorithms and Quantum Error Correction codes.☆13Feb 14, 2024Updated 2 years ago
- Code for Neural Estimation of the Rate-Distortion Function With Applications to Operational Source Coding☆14Nov 2, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 4 years ago
- Pacmed Labs experiments on uncertainty estimation, focusing on unbalanced tabular data and classification tasks.☆21May 26, 2021Updated 5 years ago
- Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.☆23Jul 30, 2024Updated last year
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- ☆13Jul 26, 2023Updated 2 years ago
- ☆18Mar 20, 2019Updated 7 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆50Dec 6, 2024Updated last year
- ☆22May 6, 2020Updated 6 years ago
- A community repository for benchmarking Bayesian methods☆11May 25, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Laplace Redux -- Effortless Bayesian Deep Learning☆45Jun 6, 2025Updated last year
- A dataset of alignment research and code to reproduce it☆79Jun 22, 2023Updated 2 years ago
- Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…☆14Aug 15, 2021Updated 4 years ago
- Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆28Jun 18, 2025Updated 11 months ago
- Research on DeepSeek Sparse Attention☆42Oct 8, 2025Updated 8 months ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- R package for calculation of 22 CAnonical Time-series CHaracteristics☆22Oct 3, 2024Updated last year
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- Graphical tools for Bayesian inference and posterior predictive checks☆21Sep 21, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆13Mar 22, 2023Updated 3 years ago
- ☆31Sep 7, 2023Updated 2 years ago
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆19May 5, 2026Updated last month
- ☆19Apr 28, 2021Updated 5 years ago
- Imshow - Flexible and Customizable Image Display with Python☆14Dec 27, 2025Updated 5 months ago
- FeelingBlue: A Corpus for Understanding the Emotional Connotation of Color in Context, accepted at TACL 2022, presented at ACL 2023☆13Dec 28, 2023Updated 2 years ago
- ☆12Mar 1, 2024Updated 2 years ago