A codebase implementing a simple GPT-like model from scratch based on the Attention is All You Need paper.
☆71Jan 9, 2024Updated 2 years ago
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Dec 14, 2024Updated last year
- Generates video game music using neural networks.☆12Jun 9, 2022Updated 3 years ago
- A min-caml port to Rust☆26Feb 8, 2025Updated last year
- Sampling techniques for Candle.☆19Apr 3, 2024Updated last year
- Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.☆29Feb 17, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆12Oct 28, 2024Updated last year
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- Following [An Incremental Approach to Compiler Construction](http://scheme2006.cs.uchicago.edu/11-ghuloum.pdf)☆70Nov 15, 2025Updated 4 months ago
- ☆12Sep 27, 2017Updated 8 years ago
- Code to train Sentence BERT Japanese model for Hugging Face Model Hub☆11Aug 8, 2021Updated 4 years ago
- A summarizer for Japanese articles (but ChatGPT is better)☆10Aug 1, 2022Updated 3 years ago
- A rust wrapper for HIP☆12Jun 10, 2025Updated 9 months ago
- ☆15Nov 27, 2018Updated 7 years ago
- T2NER: Transformers based Transfer Learning Framework for Named Entity Recognition (EACL 2021)☆11Sep 24, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- ☆18Dec 28, 2018Updated 7 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ☆17Jun 9, 2024Updated last year
- A neural network that creates new classes for humans☆24Jul 9, 2019Updated 6 years ago
- Creative mode WebGL voxel game. Runs in Chrome, with multiplayer functionality and very few dependencies☆28Updated this week
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- GPU based FFT written in Rust and CubeCL☆30Feb 23, 2026Updated last month
- Chicago Social Interaction Model (chiSIM) framework repository☆12Aug 9, 2023Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- ☆28Oct 2, 2025Updated 5 months ago
- Implementation of Monte Carlo Tree Search☆15Aug 4, 2022Updated 3 years ago
- website for the voxel.js project☆83Oct 5, 2020Updated 5 years ago
- Fairly simple wall running code for first person or third person character controller in the Godot game engine☆30Apr 29, 2020Updated 5 years ago
- RWKV-7 mini☆12Mar 29, 2025Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- MIL-RBERT: A Data-driven Approach for Noise Reduction in Distantly Supervised Biomedical Relation Extraction (BioNLP @ ACL 2020)☆21Jun 12, 2023Updated 2 years ago
- ☆18Jun 9, 2025Updated 9 months ago
- ☆25Dec 8, 2025Updated 3 months ago
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆16Jan 30, 2024Updated 2 years ago
- ☆13Feb 26, 2023Updated 3 years ago
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Apr 27, 2022Updated 3 years ago
- RWKV Wiki website (archived, please visit official wiki)☆11Mar 26, 2023Updated 3 years ago