A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple
☆17Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for TrainGPT2-127M-FromScratch
Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Abstractive text summarization done with the help of LSTMs using encoder-decoder model which was able to achieve accuracy of 77.27% on t…☆10Sep 22, 2020Updated 5 years ago
- Ancient greek dictionary☆12Feb 14, 2016Updated 10 years ago
- Code for the paper "Scene-to-Patch Earth Observation: Multiple Instance Learning for Land Cover Classification".☆14Nov 16, 2022Updated 3 years ago
- Dataset: Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, …☆12Nov 29, 2021Updated 4 years ago
- Small tutorial on how you can use BERT for Topic Modeling☆18Jun 1, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A text summarizer using Seq2Seq model☆14Sep 7, 2021Updated 4 years ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 5 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 3 months ago
- Land Use Classification CNN model using satellite imagery☆16Feb 7, 2026Updated 4 months ago
- Learning Deep Disentangled Embeddings with the F-Statistic Loss (NIPS 2018)☆10Oct 17, 2018Updated 7 years ago
- propositional satisfiability problem (SAT) goes neural and deep☆12Aug 17, 2021Updated 4 years ago
- ☆36Jan 28, 2021Updated 5 years ago
- Code for the NeurIPS 2020 paper Efficient Exact Verification of Binarized Neural Networks☆13Jun 30, 2022Updated 3 years ago
- Residual Dense Network for Super Resolution implementation in Keras☆16Oct 18, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Fourth edition of VNN COMP (2023)☆16Apr 12, 2023Updated 3 years ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Code for making #GANterpretations☆23Nov 30, 2020Updated 5 years ago
- Using Pubmed data and tf.keras to predict if an ophthalmology paper will make it into a top 15 journal☆10May 4, 2020Updated 6 years ago
- DenseQMC: A bit-slice implementation of the Quine-McCluskey algorithm☆16Dec 30, 2025Updated 5 months ago
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- HeBERT: Pre-training BERT for modern Hebrew☆80Jun 15, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13May 24, 2026Updated 3 weeks ago
- A clone of Twitter made using React, Firebase☆13May 17, 2021Updated 5 years ago
- protein embedding project☆12May 3, 2018Updated 8 years ago
- ☆18Mar 30, 2025Updated last year
- ☆25Feb 25, 2023Updated 3 years ago
- Document classification into four defined categories (World, Sports, Business, Sci/Tech). Text Pre-processing using NLTK. Trained with di…☆26Aug 18, 2020Updated 5 years ago
- List of direct speech-to-speech translation papers.☆39Jan 31, 2023Updated 3 years ago
- Data visualization workshop☆11May 12, 2020Updated 6 years ago
- Predict people interest in renting specific NYC apartments. The challenge combines structured data, geolocalization, time data, free text…☆18Nov 4, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Analysis on stop reasons☆10Jun 17, 2024Updated last year
- pretrained LookingGlass language model for biological read-length DNA sequences, and related models derived from transfer learning☆17Feb 19, 2026Updated 3 months ago
- ☆19Nov 10, 2024Updated last year
- The GIF-to-Chatter app you didn't know you needed!☆15Feb 12, 2022Updated 4 years ago
- Scripts to process aerial imagery☆33Dec 8, 2020Updated 5 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- The advanced TypeScript Discord Moderation & Utilities bot made for big public server(s). Fully written in TypeScript and discord.js.☆10Aug 3, 2023Updated 2 years ago