Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆287Mar 27, 2026Updated last week
Alternatives and similar repositories for transformer-from-scratch
Users that are interested in transformer-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)☆1,093Mar 20, 2025Updated last year
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated last month
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Mar 30, 2023Updated 3 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2…☆20Jul 19, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Communication Avoiding Numerical Dense Matrix Computations☆11Dec 20, 2020Updated 5 years ago
- Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.☆23Jun 23, 2023Updated 2 years ago
- Fast interpolative decompositions in Python☆10Jan 4, 2021Updated 5 years ago
- Code base for the EMNLP 2021 Findings paper: Cartography Active Learning☆14Jun 3, 2025Updated 10 months ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- CSC Training: High-Level GPU Programming☆14Oct 16, 2025Updated 5 months ago
- A curated list of awesome quantum computing resources. Inspired by the various awesome-* projects☆11Feb 26, 2017Updated 9 years ago
- OpenCopilot flows editor☆12Oct 31, 2023Updated 2 years ago
- Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.☆22Jul 30, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆10Sep 13, 2021Updated 4 years ago
- ☆13Jul 26, 2023Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆50Dec 6, 2024Updated last year
- A community repository for benchmarking Bayesian methods☆11May 25, 2023Updated 2 years ago
- Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…☆13Aug 15, 2021Updated 4 years ago
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆55May 5, 2023Updated 2 years ago
- Source code for paper "Discrete Latent Factor Model for Cross-Modal Hashing"☆18Aug 21, 2020Updated 5 years ago
- A WGAN-GP that utilizes a compositional pattern producing network as the generator☆11Sep 9, 2021Updated 4 years ago
- Experimentation with Streamlit for personal LLM tool☆15Jun 19, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆17Jul 18, 2024Updated last year
- NLP course @ CS Faculty, HSE☆15Mar 4, 2020Updated 6 years ago
- ☆13Mar 22, 2023Updated 3 years ago
- ☆31Sep 7, 2023Updated 2 years ago
- Code for Paper "PMAES: Prompt-mapping Contrastive Learning for Cross-prompt Automated Essay Scoring" ACL2023☆11Oct 6, 2023Updated 2 years ago
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF method☆20May 27, 2025Updated 10 months ago
- FeelingBlue: A Corpus for Understanding the Emotional Connotation of Color in Context, accepted at TACL 2022, presented at ACL 2023☆13Dec 28, 2023Updated 2 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆67Feb 13, 2025Updated last year
- Code files for my medium blog☆17Oct 20, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- VCWorld: A Biological World Model for Virtual Cell Simulation☆41Feb 7, 2026Updated 2 months ago
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- A repository with the code related to experiments around context-aware machine translation☆51Sep 22, 2025Updated 6 months ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Mar 12, 2026Updated 3 weeks ago
- ☆12Dec 8, 2022Updated 3 years ago
- Fork of cyclops-community/ctf repository updated haphazardly, previously this was main repo location☆10Aug 7, 2018Updated 7 years ago
- A code for the paper Learning Representations for Soft Skill Matching☆12Nov 17, 2023Updated 2 years ago