akshat0123 / GPT-1
Pytorch implementation of GPT-1
☆18Updated 2 years ago
Alternatives and similar repositories for GPT-1:
Users that are interested in GPT-1 are comparing it to the libraries listed below
- seq2seq_translation☆26Updated 3 years ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆73Updated 5 months ago
- 大模型基础学习和面试八股文☆86Updated 10 months ago
- personal chatgpt☆337Updated 2 months ago
- 通义千问的DPO训练☆33Updated 5 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆151Updated 4 months ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆112Updated last year
- A paper list of pre-trained language models (PLMs).☆80Updated 3 years ago
- pytorch分布式训练☆63Updated last year
- ☆29Updated 6 months ago
- ☆37Updated 4 months ago
- ☆62Updated last month
- 使用单个24G显卡,从0开始训练LLM☆50Updated 3 months ago
- ☆102Updated 7 months ago
- Getting Started with Google BERT, published by Packt☆18Updated last year
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆56Updated 10 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 10 months ago
- Individual learning to implement some modules☆24Updated 6 months ago
- everything about llm & aigc☆56Updated last month
- ☆48Updated 6 months ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆257Updated 4 years ago
- An implementation of the BERT model and its related downstream tasks based on the PyTorch framework☆579Updated 3 months ago
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆11Updated 2 months ago
- pytorch distribute tutorials☆104Updated this week
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆64Updated last year
- ☆104Updated 3 months ago
- ☆123Updated 6 months ago
- ☆159Updated 3 years ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆49Updated last month