Norod / TrainGPT2-127M-FromScratchView external linksLinks
A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple
☆17Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for TrainGPT2-127M-FromScratch
Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below
Sorting:
- Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/210…☆29Jan 4, 2022Updated 4 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆13Jan 12, 2026Updated last month
- Tool for sentiment analysis annotation☆13Mar 26, 2025Updated 10 months ago
- protein embedding project☆12May 3, 2018Updated 7 years ago
- Optimized Circuit Generation for Secure Multiparty Computation☆12Nov 25, 2019Updated 6 years ago
- QMK Homebrew Formulae☆16May 28, 2025Updated 8 months ago
- An old webgl learning project☆10Jan 14, 2015Updated 11 years ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- Vite + Mantine + Vanilla extract template☆12Feb 1, 2026Updated last week
- The repository of the hands-on introduction to machine learning workshop of the DataLearn 2019 track at DataHack 2019.☆10Sep 1, 2019Updated 6 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- ☆14Dec 10, 2025Updated 2 months ago
- Diffusion for EEG☆11Jan 2, 2023Updated 3 years ago
- This is a Python project that uses Selenium and OpenAI to scrape data from the web, process it with GPT-3, and generate reports based on …☆12Oct 28, 2025Updated 3 months ago
- Data visualization workshop☆11May 12, 2020Updated 5 years ago
- A jailbreak tweak to respring your device using the hardware buttons☆11Jun 9, 2020Updated 5 years ago
- ☆13Apr 23, 2025Updated 9 months ago
- A command-line benchmarking tool to measure the startup times of programs in various languages☆14Oct 17, 2020Updated 5 years ago
- Reverse Engineering the Tabstate files for Windows Notepad☆10May 1, 2024Updated last year
- Eval LLMs☆11May 12, 2024Updated last year
- XDeFi Yield Farming & XDEX on Ethereum☆12Apr 20, 2021Updated 4 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- pretrained LookingGlass language model for biological read-length DNA sequences, and related models derived from transfer learning☆15Sep 8, 2022Updated 3 years ago
- Using Xaml in the Win32 app model using DesktopWindowXamlSource☆16Jul 19, 2024Updated last year
- 📖 A review of KGEM packages and frameworks at https://pykeen.github.io/kgem-software-review.☆12Jun 24, 2024Updated last year
- ☆12Feb 13, 2025Updated last year
- A python tool to examine datasets for consistency. Performs approximately 150 tests. For EDA (Exploratory Data Analysis) and interpretabl…☆10Apr 4, 2024Updated last year
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- Apple1 Replica in Javascript (Node & Browser) - https://stid.me☆14Oct 4, 2025Updated 4 months ago
- Go implementation of the Peer-to-Peer Streaming Peer Protocol (rfc7574)☆11Sep 24, 2017Updated 8 years ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- ☆10Mar 20, 2021Updated 4 years ago
- Skillset Challenge for the Apprenticeship Program, June 2021.☆10Jan 8, 2022Updated 4 years ago
- A CSV formatted file using data from the Yelp Academic Dataset.☆12May 7, 2016Updated 9 years ago
- An ES6 Map wrapper for the synchronous userscript storage API