A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple
☆17Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for TrainGPT2-127M-FromScratch
Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Disease Pattern Miner is a free, open-source mining framework for interactively discovering sequential disease patterns in medical health…☆12Mar 21, 2019Updated 7 years ago
- A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data w…☆22Aug 12, 2025Updated 7 months ago
- Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.☆22Jul 6, 2022Updated 3 years ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Sep 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 3 weeks ago
- Synthesizing and manipulating 2048x1024 images with conditional GANs☆33Oct 20, 2022Updated 3 years ago
- Boids simulation implemented with Compute Shader and VFX Graph.☆14May 15, 2024Updated last year
- ☆36Jan 28, 2021Updated 5 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Code for the NeurIPS 2020 paper Efficient Exact Verification of Binarized Neural Networks☆13Jun 30, 2022Updated 3 years ago
- Postman & Chatbot Arena for inference benchmarking.☆14Jun 19, 2025Updated 9 months ago
- ☆34Dec 8, 2014Updated 11 years ago
- A translation of Apple's sample code SceneKit State of the Union Demo into Swift☆15Apr 10, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- How to make a custom SCNGeometry☆19Nov 27, 2022Updated 3 years ago
- Playing with CSM☆22Mar 14, 2025Updated last year
- HeBERT: Pre-training BERT for modern Hebrew☆81Jun 15, 2023Updated 2 years ago
- A clone of Twitter made using React, Firebase☆13May 17, 2021Updated 4 years ago
- ☆10Sep 16, 2015Updated 10 years ago
- An application that brings together several anime streaming platforms☆11Mar 1, 2025Updated last year
- ☆18Dec 30, 2016Updated 9 years ago
- ☆28Feb 25, 2026Updated last month
- Data visualization workshop☆11May 12, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Jun 17, 2022Updated 3 years ago
- Sample code for my blog post about Custom SceneKit Geometry☆18Jun 3, 2013Updated 12 years ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆18Nov 4, 2017Updated 8 years ago
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Jun 18, 2025Updated 9 months ago
- Build and sign Windows MSIX packages and bundles with Rust☆24Mar 18, 2026Updated last week
- pretrained LookingGlass language model for biological read-length DNA sequences, and related models derived from transfer learning☆15Feb 19, 2026Updated last month
- The official implementation of ImageBind-LLM and Whisper-LLM from the paper "Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Compre…☆21Oct 30, 2023Updated 2 years ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- HLS proxy server in iOS app.☆21Jan 25, 2016Updated 10 years ago
- A machine learning based object detector☆16Feb 12, 2017Updated 9 years ago
- ☆18Nov 10, 2024Updated last year
- Separating Axis Theorem test using SFML/C++.☆11Apr 7, 2017Updated 8 years ago
- A simple chat application based on Socket.IO, React, and Express.☆15Dec 15, 2021Updated 4 years ago
- ☆18Sep 10, 2025Updated 6 months ago
- The GIF-to-Chatter app you didn't know you needed!☆15Feb 12, 2022Updated 4 years ago