akshat0123 / GPT-1

Pytorch implementation of GPT-1

☆18

Alternatives and similar repositories for GPT-1:

Users that are interested in GPT-1 are comparing it to the libraries listed below

shouxieai / seq2seq_translation
seq2seq_translation
☆26Updated 3 years ago
percent4 / llm_math_solver
本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。
☆73Updated 5 months ago
Fyfy1996 / LLM_reviews
大模型基础学习和面试八股文
☆86Updated 10 months ago
chunhuizhang / personal_chatgpt
personal chatgpt
☆337Updated 2 months ago
owenliang / qwen-dpo
通义千问的DPO训练
☆33Updated 5 months ago
firechecking / CleanTransformer
an implementation of transformer, bert, gpt, and diffusion models for learning purposes
☆151Updated 4 months ago
l294265421 / alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
☆112Updated last year
wxl1999 / PLMPapers
A paper list of pre-trained language models (PLMs).
☆80Updated 3 years ago
taishan1994 / pytorch-distributed-NLP
pytorch分布式训练
☆63Updated last year
lansinuote / Simple_RLHF_Llama3
☆29Updated 6 months ago
RethinkFun / trian_ppo
☆37Updated 4 months ago
chunhuizhang / bert_t5_gpt
☆62Updated last month
akaihaoshuai / baby-llama2-chinese_cybertron
使用单个24G显卡，从0开始训练LLM
☆50Updated 3 months ago
yuanzhoulvpi2017 / SentenceEmbedding
☆102Updated 7 months ago
LolitaSian / Getting-Started-with-Google-BERT
Getting Started with Google BERT, published by Packt
☆18Updated last year
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆56Updated 10 months ago
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated 10 months ago
wdndev / personal
Individual learning to implement some modules
☆24Updated 6 months ago
chunhuizhang / llm_aigc
everything about llm & aigc
☆56Updated last month
RethinkFun / LLM
☆48Updated 6 months ago
Lisennlp / TinyBert
简洁易用版TinyBert：基于Bert进行知识蒸馏的预训练语言模型
☆257Updated 4 years ago
moon-hotel / BertWithPretrained
An implementation of the BERT model and its related downstream tasks based on the PyTorch framework
☆579Updated 3 months ago
WindyLee0822 / CTG
Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)
☆11Updated 2 months ago
chunhuizhang / pytorch_distribute_tutorials
pytorch distribute tutorials
☆104Updated this week
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆64Updated last year
hengjiUSTC / learn-llm
☆104Updated 3 months ago
GCYZSL / MoLA
☆123Updated 6 months ago
lansinuote / Transformer_Example
☆159Updated 3 years ago
Wangmerlyn / MCTS-GSM8k-Demo
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆49Updated last month