Re-implementation of Andrej Karpathy's nanoGPT
☆16Feb 16, 2023Updated 3 years ago
Alternatives and similar repositories for GPT-from-scratch
Users that are interested in GPT-from-scratch are comparing it to the libraries listed below
Sorting:
- Helper package for matching addresses with an integration to Ordnance Survey API.☆10Jan 7, 2025Updated last year
- Enable moe for nanogpt.☆35Dec 11, 2023Updated 2 years ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- Simple proxy app made with HTML, Css, Javascript. Get random free Http/Https proxies.☆12Aug 25, 2024Updated last year
- Multi-turn dataset management tool for LLM trainers☆12Mar 31, 2025Updated 11 months ago
- ☆16Jul 7, 2025Updated 8 months ago
- ☆18Mar 3, 2026Updated last week
- AI21 Typescript SDK☆13Dec 18, 2025Updated 2 months ago
- Examples of how machine learning and deep learning can be applied in practice☆12Dec 8, 2022Updated 3 years ago
- gpt from 0 -> 1☆11Oct 9, 2025Updated 5 months ago
- Implement FlashAttention v2 with minimal code to learn.☆15Jun 12, 2024Updated last year
- ☆11Dec 26, 2022Updated 3 years ago
- fork of karparthy's nanogpt with custom datasets☆10Jul 25, 2023Updated 2 years ago
- Adsensor is an anti-fraud and cloaking tool built in PHP and JS☆14Nov 8, 2018Updated 7 years ago
- A YellowPage scraper is a Python program/script that extracts data from the YellowPages.com website using the Python programming language…☆11Apr 14, 2023Updated 2 years ago
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆13Dec 9, 2023Updated 2 years ago
- rotating proxy server☆11Sep 17, 2024Updated last year
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆18Jun 24, 2024Updated last year
- MLOPS examples☆12Mar 22, 2023Updated 2 years ago
- a package to collapse the outputs of CARET's confusion matrix for storage in database tables☆16Jun 6, 2024Updated last year
- ☆13Apr 15, 2024Updated last year
- Continuous Machine Learning with Kubeflow, published by BPB Publications☆14Jul 5, 2022Updated 3 years ago
- ZLUDA PTX test suite☆21Feb 17, 2026Updated 3 weeks ago
- 博客代码:快过年了,搞个AI作曲,用TensorFlow训练midi文件☆17Dec 24, 2022Updated 3 years ago
- ☆14Nov 30, 2021Updated 4 years ago
- tensorflow implementation of GHM-C Loss☆12Apr 26, 2019Updated 6 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆15Aug 31, 2023Updated 2 years ago
- Metabypass | Javascript-based easy implementation for solving any type of captcha by Metabypass☆21Jun 21, 2023Updated 2 years ago
- Super scalar Processor design☆21Sep 7, 2014Updated 11 years ago
- 🧠 A study guide to learn about Transformers☆12Jan 11, 2024Updated 2 years ago
- Large-scale exact string matching tool☆17Mar 7, 2025Updated last year
- ☆15Dec 28, 2020Updated 5 years ago
- Start your own CAPTCHA solving business portal like https://captchas.io☆11Nov 26, 2025Updated 3 months ago
- Codes for "NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer" (ACL 2021 findings)☆15Nov 3, 2021Updated 4 years ago
- Finetune and Inference Qwen3-0.6B.☆28May 5, 2025Updated 10 months ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- A program to create proxies using the Vultr API.☆18May 13, 2018Updated 7 years ago
- ☆15Mar 17, 2021Updated 4 years ago
- Code for paper: A Neural Span-Based Continual Named Entity Recognition Model☆18Dec 11, 2023Updated 2 years ago