sleepingcat4 / TinyStoriesLinks

code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper

☆39

Alternatives and similar repositories for TinyStories

Users that are interested in TinyStories are comparing it to the libraries listed below

Sorting:

hydrallm / llama-moe-v1
☆96Updated 2 years ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
zphang / minimal-llama
☆460Updated last year
epfml / landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆423Updated last year
ethanyanjiali / minChatGPT
A minimum example of aligning language models with RLHF similar to ChatGPT
☆221Updated last year
imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆199Updated last year
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
HazyResearch / TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆200Updated 2 years ago
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆167Updated last year
sabetAI / BLoRA
batched loras
☆345Updated last year
IST-DASLab / qmoe
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆277Updated last year
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆136Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆104Updated 3 months ago
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆103Updated 2 years ago
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆223Updated last year
LAION-AI / Open-Instruction-Generalist
Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks
☆208Updated last year
luohongyin / SAIL
SAIL: Search Augmented Instruction Learning
☆157Updated last month
sanjeevanahilan / nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
☆290Updated last year
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
orhonovich / unnatural-instructions
☆180Updated 2 years ago
KyujinHan / Sakura-SOLAR-DPO
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Updated last year
declare-lab / flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆353Updated 2 years ago
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆145Updated last year
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆193Updated last year
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆219Updated last year
nlpxucan / evol-instruct
☆273Updated 2 years ago
RahulSChand / llama2.c-for-dummies
Step by step explanation/tutorial of llama2.c
☆223Updated last year