A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple
☆17Jun 29, 2020Updated 5 years ago
Alternatives and similar repositories for TrainGPT2-127M-FromScratch
Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Abstractive text summarization done with the help of LSTMs using encoder-decoder model which was able to achieve accuracy of 77.27% on t…☆10Sep 22, 2020Updated 5 years ago
- Code for the paper "Scene-to-Patch Earth Observation: Multiple Instance Learning for Land Cover Classification".☆14Nov 16, 2022Updated 3 years ago
- Dataset: Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, …☆12Nov 29, 2021Updated 4 years ago
- ☆25Sep 7, 2023Updated 2 years ago
- Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.☆22Jul 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- ☆15Feb 18, 2023Updated 3 years ago
- A text summarizer using Seq2Seq model☆14Sep 7, 2021Updated 4 years ago
- Simple and Easy Tool for install and manage openFrameworks libraries and projects☆24Aug 24, 2014Updated 11 years ago
- ☆16May 25, 2019Updated 6 years ago
- iPad app draws beautiful 2D and 3D images using Swift and Metal☆11Jan 14, 2020Updated 6 years ago
- Colab Notebooks for using Instant-NGP with View Control☆11Oct 7, 2023Updated 2 years ago
- Streamlit OpenAI app to chat with custom text documents of all kinds☆13Apr 8, 2026Updated last week
- Using Explainable Artificial Intelligence (XAI) for sentiment analysis (NLP)☆14Mar 28, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation and helper scripts for the BART-TL model - https://www.aclweb.org/anthology/2021.eacl-main.121/☆17May 20, 2021Updated 4 years ago
- Wrapper for Shikimori API☆10Apr 17, 2023Updated 2 years ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- This lib provides you readymade video editor free hand tool such as lines, free line, angle drawing. After editing it, you can also store…☆17Apr 10, 2024Updated 2 years ago
- Vite + Mantine + Vanilla extract template☆12Apr 6, 2026Updated last week
- An old webgl learning project☆10Jan 14, 2015Updated 11 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- ☆36Jan 28, 2021Updated 5 years ago
- propositional satisfiability problem (SAT) goes neural and deep☆12Aug 17, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- a mac app that converts videos to the azimuthal projection (tiny planet)☆13Jun 20, 2017Updated 8 years ago
- A UIImageView subclass that ignores touches on transparent pixels. Based on OBShapedButton by Ole Begemann.☆84Aug 24, 2013Updated 12 years ago
- Promise.swift - A Promise implementation written in Swift☆11Dec 15, 2015Updated 10 years ago
- A little project to create pixelated images.☆16Mar 27, 2020Updated 6 years ago
- FastSpec: Scalable Generation and Detection of Spectre Gadgets Using Neural Embeddings☆13Apr 12, 2023Updated 3 years ago
- Starter kit for full stack apps on Cloudflare developer platform. No framework lock-in.☆22Apr 7, 2026Updated last week
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Code for making #GANterpretations☆23Nov 30, 2020Updated 5 years ago
- Minimized version of the Orchis server hosted at https://orchis.cherrymint.live☆10Nov 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Latin text dataset for machine learning and procedural text generation☆20Jun 3, 2024Updated last year
- Using Pubmed data and tf.keras to predict if an ophthalmology paper will make it into a top 15 journal☆10May 4, 2020Updated 5 years ago
- Learn advanced iOS development by building a clone of the Whale App☆10May 14, 2019Updated 6 years ago
- DenseQMC: A bit-slice implementation of the Quine-McCluskey algorithm☆16Dec 30, 2025Updated 3 months ago
- A Sample iOS project for Realtime Capture Effect of OpenCV. (Swift & Objective-c lang)☆14Sep 21, 2015Updated 10 years ago
- Fast Artistic Videos in pyTorch☆14Oct 3, 2023Updated 2 years ago
- Python library to collect performance events☆14Jan 30, 2023Updated 3 years ago