Norod / TrainGPT2-127M-FromScratch
A trio of Google-Colab notebooks (ipynb) for training a GPT-2 (127M) model from scratch (useful for other / non-English languages) using gpt-2-simple
☆15Updated 4 years ago
Alternatives and similar repositories for TrainGPT2-127M-FromScratch:
Users that are interested in TrainGPT2-127M-FromScratch are comparing it to the libraries listed below
- ☆14Updated last year
- Experimental sampler to make LLMs more creative☆30Updated last year
- ☆48Updated last year
- Using short models to classify long texts☆21Updated last year
- ☆27Updated last year
- ☆15Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 11 months ago
- Fast AI Practical Deep Learning for Coders experiments in Stable Diffusion☆23Updated 2 years ago
- Merge LLM that are split in to parts☆26Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- ☆28Updated last year
- A repository re-creating the PromptBreeder Evolutionary Algorithm from the DeepMind Paper in Python using LMQL as the backend.☆27Updated last year
- ☆32Updated last year
- ☆17Updated last year
- ☆27Updated last year
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆27Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated 11 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 11 months ago
- ☆54Updated last year
- ☆13Updated last year
- Image restoration with neural networks but without learning.☆46Updated 2 years ago
- Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version☆26Updated 2 years ago
- ☆26Updated 11 months ago
- Modified Beam Search with periodical restart☆12Updated 5 months ago
- Text-writing denoising diffusion (and much more)☆30Updated last year
- ☆21Updated 4 years ago
- ☆24Updated last year
- ☆20Updated 2 months ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated last year