coaxsoft / pytorch_bertLinks

Tutorial for how to build BERT from scratch

☆100

Alternatives and similar repositories for pytorch_bert

Users that are interested in pytorch_bert are comparing it to the libraries listed below

Sorting:

jsbaan / transformer-from-scratch
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆271Updated last year
hkproj / pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
☆361Updated 2 years ago
hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆117Updated 2 years ago
ChanCheeKean / DataScience
☆81Updated last year
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆215Updated last year
HumanSignal / RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…
☆224Updated 2 years ago
aju22 / LLaMA2
This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…
☆74Updated 2 years ago
affjljoo3581 / GPT2
PyTorch Implementation of OpenAI GPT-2
☆351Updated last year
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆581Updated last year
knotgrass / How-Transformers-Work
🧠 A study guide to learn about Transformers
☆12Updated last year
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆88Updated last year
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆118Updated last year
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆215Updated 8 months ago
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆397Updated last year
ethanyanjiali / minChatGPT
A minimum example of aligning language models with RLHF similar to ChatGPT
☆224Updated 2 years ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆489Updated 2 years ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆258Updated 2 years ago
philschmid / knowledge-distillation-transformers-pytorch-sagemaker
☆47Updated 3 years ago
neubig / minllama-assignment
☆99Updated last year
hkproj / bert-from-scratch
BERT explained from scratch
☆16Updated 2 years ago
melisa-writer / short-transformers
Prune transformer layers
☆74Updated last year
shreyansh26 / Annotated-ML-Papers
Annotations of the interesting ML papers I read
☆266Updated last month
BlackSamorez / tensor_parallel
Automatically split your PyTorch models on multiple GPUs for training & inference
☆657Updated last year
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆112Updated last year
mzbac / llama2-fine-tune
Scripts for fine-tuning Llama2 via SFT and DPO.
☆205Updated 2 years ago
evintunador / minLlama3
a simplified version of Meta's Llama 3 model to be used for learning
☆43Updated last year
thomfoster / minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
☆88Updated 3 years ago
AliHaiderAhmad001 / BERT-from-Scratch-with-PyTorch
Implementation of BERT-based Language Models
☆24Updated last year
huggingface / transformers-research-projects
Research projects built on top of Transformers
☆103Updated 8 months ago
huggingface / picotron_tutorial
☆224Updated last week