A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple
☆17Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for TrainGPT2-127M-FromScratch
Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ancient greek dictionary☆12Feb 14, 2016Updated 10 years ago
- Human Readable Unique Identifiers for Python☆14Nov 2, 2019Updated 6 years ago
- A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data w…☆23Aug 12, 2025Updated 9 months ago
- Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.☆22Jul 6, 2022Updated 3 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Using Conditional Random Fields for segmenting Latin words written in scriptio continua☆10May 30, 2018Updated 7 years ago
- Colab Notebooks for using Instant-NGP with View Control☆11Oct 7, 2023Updated 2 years ago
- memo☆13Dec 22, 2022Updated 3 years ago
- Synthesizing and manipulating 2048x1024 images with conditional GANs☆33Oct 20, 2022Updated 3 years ago
- Wrapper for Shikimori API☆10Apr 17, 2023Updated 3 years ago
- Grammar exercises generated from books & subtitles☆21Jan 9, 2024Updated 2 years ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- ☆36Jan 28, 2021Updated 5 years ago
- Residual Dense Network for Super Resolution implementation in Keras☆16Oct 18, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fourth edition of VNN COMP (2023)☆16Apr 12, 2023Updated 3 years ago
- A tool for the geospatial analysis, literary network visualization, and plot mapping of ancient texts☆15Sep 16, 2018Updated 7 years ago
- Code for making #GANterpretations☆23Nov 30, 2020Updated 5 years ago
- Eval LLMs☆11May 12, 2024Updated 2 years ago
- UnRPA + UnRPYC + CMD = De_RenPy [afternoon project]☆16Aug 18, 2024Updated last year
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- Fast Artistic Videos in pyTorch☆14Oct 3, 2023Updated 2 years ago
- GPT-2 User Interface based on HuggingFace's Pytorch Implementation☆56Jul 25, 2024Updated last year
- ☆10Jun 29, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- HeBERT: Pre-training BERT for modern Hebrew☆80Jun 15, 2023Updated 2 years ago
- ☆13Updated this week
- protein embedding project☆12May 3, 2018Updated 8 years ago
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Jun 17, 2022Updated 3 years ago
- My personal cheat sheets☆12May 5, 2026Updated 3 weeks ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆18Nov 4, 2017Updated 8 years ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- 🏛 Classical studies (Latin and Ancient Greek) resources: software, code and raw data☆22May 11, 2016Updated 10 years ago
- A library for your Ren'Py 8+ project☆15Oct 3, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Simply localize text inside your HTML webpage.☆20Oct 31, 2022Updated 3 years ago
- pretrained LookingGlass language model for biological read-length DNA sequences, and related models derived from transfer learning☆17Feb 19, 2026Updated 3 months ago
- This repository contains material of a teaching innovation project in Universitat de Barcelona: "Intelligent Support System for Tutor of …☆10Jun 30, 2020Updated 5 years ago
- ☆19Nov 10, 2024Updated last year
- The GIF-to-Chatter app you didn't know you needed!☆15Feb 12, 2022Updated 4 years ago
- In-browser OCR of Ancient Greek and Latin☆27May 18, 2026Updated last week
- The advanced TypeScript Discord Moderation & Utilities bot made for big public server(s). Fully written in TypeScript and discord.js.☆10Aug 3, 2023Updated 2 years ago