A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple
☆17Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for TrainGPT2-127M-FromScratch
Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- Using Conditional Random Fields for segmenting Latin words written in scriptio continua☆10May 30, 2018Updated 7 years ago
- This is a Python project that uses Selenium and OpenAI to scrape data from the web, process it with GPT-3, and generate reports based on …☆12Oct 28, 2025Updated 6 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 2 months ago
- Synthesizing and manipulating 2048x1024 images with conditional GANs☆33Oct 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Learning Deep Disentangled Embeddings with the F-Statistic Loss (NIPS 2018)☆10Oct 17, 2018Updated 7 years ago
- Discover how to build vision transformer from scratch with this comprehensive tutorial. Follow our step-by-step guide to create your own …☆11Apr 14, 2023Updated 3 years ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- Vite + Mantine + Vanilla extract template☆12Apr 27, 2026Updated last week
- ☆13Apr 12, 2023Updated 3 years ago
- Python API service for headless Chromium☆20Jul 19, 2020Updated 5 years ago
- ☆36Jan 28, 2021Updated 5 years ago
- propositional satisfiability problem (SAT) goes neural and deep☆12Aug 17, 2021Updated 4 years ago
- a mac app that converts videos to the azimuthal projection (tiny planet)☆13Jun 20, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Nov 7, 2017Updated 8 years ago
- ☆17Feb 12, 2025Updated last year
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Postman & Chatbot Arena for inference benchmarking.☆14Jun 19, 2025Updated 10 months ago
- Fourth edition of VNN COMP (2023)☆16Apr 12, 2023Updated 3 years ago
- Explore a floating dice by moving your iPhone around.☆17Jul 30, 2013Updated 12 years ago
- FastSpec: Scalable Generation and Detection of Spectre Gadgets Using Neural Embeddings☆13Apr 12, 2023Updated 3 years ago
- Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/210…☆29Jan 4, 2022Updated 4 years ago
- Apple DeviceCheck server implementation on Cloudflare Workers☆21Feb 27, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for making #GANterpretations☆23Nov 30, 2020Updated 5 years ago
- Team project using natural language to query blockchain data☆13Apr 17, 2023Updated 3 years ago
- Implementation of Kronecker Attention in Pytorch☆20Sep 12, 2020Updated 5 years ago
- iOS Expense Tracker App in SwiftUI☆11Apr 10, 2024Updated 2 years ago
- Eval LLMs☆11May 12, 2024Updated last year
- DenseQMC: A bit-slice implementation of the Quine-McCluskey algorithm☆16Dec 30, 2025Updated 4 months ago
- A Sample iOS project for Realtime Capture Effect of OpenCV. (Swift & Objective-c lang)☆14Sep 21, 2015Updated 10 years ago
- Fast Artistic Videos in pyTorch☆14Oct 3, 2023Updated 2 years ago
- Reverse AVAsset Video☆11Feb 6, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆55Apr 12, 2024Updated 2 years ago
- HeBERT: Pre-training BERT for modern Hebrew☆81Jun 15, 2023Updated 2 years ago
- A clone of Twitter made using React, Firebase☆13May 17, 2021Updated 4 years ago
- protein embedding project☆12May 3, 2018Updated 8 years ago
- ☆21Jul 25, 2024Updated last year
- Data visualization workshop☆11May 12, 2020Updated 5 years ago
- AR app that tracks the location of a car☆17Mar 10, 2018Updated 8 years ago