AliHaiderAhmad001 / BERT-from-Scratch-with-PyTorchLinks

Implementation of BERT-based Language Models

☆19

Alternatives and similar repositories for BERT-from-Scratch-with-PyTorch

Users that are interested in BERT-from-Scratch-with-PyTorch are comparing it to the libraries listed below

Sorting:

coaxsoft / pytorch_bert
Tutorial for how to build BERT from scratch
☆97Updated last year
hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆112Updated 2 years ago
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆207Updated last year
jsbaan / transformer-from-scratch
Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.
☆254Updated last year
hkproj / pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
☆345Updated last year
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆76Updated last year
cmu-l3 / anlp-spring2025-code
Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/
☆61Updated 4 months ago
neubig / minllama-assignment
☆90Updated 10 months ago
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆101Updated last year
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆164Updated 4 months ago
tianlinxu312 / Everything-about-LLMs
A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gra…
☆195Updated last year
genaibook / genaibook
Contains the public resources of Hands on GenAI book
☆184Updated 7 months ago
ChanCheeKean / DataScience
☆84Updated last year
huggingface / transformers-research-projects
Research projects built on top of Transformers
☆69Updated 4 months ago
lucasdelimanogueira / PyNorch
Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)
☆151Updated last year
EurekaLabsAI / mlp
The Multilayer Perceptron Language Model
☆557Updated 11 months ago
AIMO-CMU-MATH / CMU_MATH-AIMO
☆76Updated last year
hkproj / dpo-notes
Notes on Direct Preference Optimization
☆21Updated last year
hkproj / transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
☆299Updated 2 years ago
aju22 / LLaMA2
This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…
☆70Updated last year
hkproj / triton-flash-attention
☆184Updated 7 months ago
markhliu / DGAI
Learn Generative AI with PyTorch (Manning Publications, 2024)
☆116Updated 2 months ago
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆574Updated last year
rasbt / cvpr2023
☆133Updated last year
HumanSignal / RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…
☆224Updated 2 years ago
rasbt / faster-pytorch-blog
Outlining techniques for improving the training performance of your PyTorch model without compromising its accuracy
☆128Updated 2 years ago
PacktPublishing / Accelerate-Model-Training-with-PyTorch-2.X
Accelerate Model Training with PyTorch 2.X, published by Packt
☆46Updated last year
FareedKhan-dev / create-million-parameter-llm-from-scratch
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆181Updated last year
slds-lmu / seminar_multimodal_dl
https://slds-lmu.github.io/seminar_multimodal_dl/
☆170Updated 2 years ago
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆94Updated 4 months ago